The 


PHILOSOPHICAL 
MAGAZINE _% 


E i) We 

Cae 

FIRST PUBLISHED IN 1798 ve) i 
rd 


yaa) 
Zs 
2% 
a 


L. 42 SEVENTH SERIES No. 330 July, 1951 


A Journal of 


Theoretical Kaxperimental 


and Applied Physics 


: EDITOR 
PROFESSOR N. F. MOTT, M.A., D.Sc., F.R.S. 


EDITORIAL BOARD 
SIR LAWRENCE BRAGG, 0.B.E.,':M.C., M.A., D.Sc., F.R.S. 
ALLAN FERGUSON, M.A., D.Sc. 
SIR GEORGE THOMSON, M.A., D.Sc., F.R.S. 
PROFESSOR A. M. TYNDALL, C.B.E., D.Sc., F.R.S. 


PRICE 12s. Od. a 

Annual Subscription £6 Os. Od. payable in advance. 
EE emeememeemematammnasiamemcmiacessaaaaacaassaaaaaaaaaaaaaacaccaaaaaaaaaaaaaaaamaaaaaaaaaaaa 
. AND PUBLISHED BY TAYLOR & FRANCIS LTD., RED LION COURT, FLEET ST., LONDON E.C.4., 


Early Scientific Publications 


DIARY OF ROBERT HOOKE \W.A., M.D., F.R.S. 
1672-1680 


Edited by H. W, ROBINSON and W. ADAMS 
Recommended for publication by the Royal Society, 


London 
25/- “This vivid record of the scientific, artistic and social 
net activities of a remarkable man during remarkable years has 
too long remained in obscurity.’’—Extract from foreword by 


Sir Frederick Gowland Hopkins, O.M., President of the 
Royal Society. 


MATHEMATICAL WORK OF JOHN WALLIS, D.D., F.R.S. 


By J. F. SCOTT, Ph.D., B.A. 
“His work will be indispensable to those interested in the — 


12/6 early history of The Royal Society. | commend to all 
students of the Seventeenth Century, whether scientific or 
a. humane, this learned and lucid book.’’—Extract from fore- 


word by Prof. E. N. da C. Andrade, D.Sc., Ph.D., F.R.S. 
Recommended for publication by University of London 


CORRESPONDENCE AND PAPERS OF EDMOND HALLEY 


Arranged and Edited by EUGENE FAIRFIELD MACPIKE 


21 /- First published on behalf of The History of Science 
Society by Oxford University Press. Now re-issued by 


ner Taylor & Francis, Ltd. 


MEMOIRS OF SIR ISAAC NEWTON’S LIFE 


= By WILLIAM STUKELEY, M.D., F.R.S., 1752 
/- From an Original Manuscript 
net Now in the possession of the Royal Society, London 


HEVELIUS, FLAMSTEED AND HALLEY 


Three Contemporary Astronomers and their Mutual Relations 


12/6 By EUGENE FAIRFIELD MACPIKE 
net Published by arrangement with The History of Science 
Society 
Established - : 
seer ig veeet TAYLOR & FRAN CIs, LTD. 


RED LION COURT, FLEET ST., LONDON E.C. 
PRINTERS & PUBLISHERS OF SCIENTIFIC BOOKS 


fo 73 ae] 


LXX, Anelastic Measurements of Diffusion Coefficients in F.C.C. 
Substitutional Solid Solutions. 


By A. D. Lz Cuarre, 
Atomic Energy Research Establishment, Harwell *. 


[Received April 3, 1951.] 


SUMMARY. 


It is shown in this paper how in face-centred cubic substitutional solid 
solutions of metals, measurements upon that form of anelasticity which is 
associated with the preferential reorientation of pairs of solute atoms in the 
presence of a tensile stress, may be used to obtain the diffusion coefficient of 
the solute atoms. The method is closely analogous to the one employed 
recently by Zener and by Wert to measure diffusion coefficients in inter- 
stitial solid solutions. . 

The theory is applied to some previously reported anelastic measure- 
ments on (70/30) «-brass by Zener and by Ké to give the diffusion coefficient 
of zinc in the brass. The results show good agreement with directly 
measured and suitably corrected diffusion coefficients when these. are 
extrapolated to the lower temperatures at which the anelastic measure- 
ments were made. 

There follows a discussion of the advantages and disadvantages of this 
method of measuring diffusion coefficients and of some possible further 
developments of the theory. 


§1. INTRODUCTION. 


SEVERAL recent papers (Wert 1950a,b, Wert and Zener 1949) have made 
use of the possibility of calculating the diffusion coefficient of solute atoms 
in interstitial solid solutions from measurements on the anelasticity of such 
solutions associated with the siting of solute atoms on preferred positions in 
the presence of an applied stress. Such calculations provide values of the 
diffusion coefficient at much lower temperatures than those at which direct 
measurements are possible. Itis the purpose of this paper to show that the 
same can be done for substitutional solid solutions, by measurements on 
the anelasticity of such solutions associated with the preferential orienta- 
tion of pairs of solute atoms induced by an applied stress. 


§ 2. Tae ANELASTICITY ARISING FROM THE PREFERENTIAL ORIENTATION 
oF SotutTE ATom Pairs. 


In a face-centred cubic solid solution such as «-brass, the zinc and copper 
atoms are arranged at random on a face-centred cubic lattice. Since the 
zinc atoms are larger than the copper atoms, each zinc atom will be the 


* Communicated by the Author. 
SER. 7, VOL. 42, NO. 330.—JULY I95I 3A 


674 A. D. Le Claire on the Anelastic Measurements of 


centre of an elastic distortion in the lattice, which for lone zone atoms will 
have a cubic symmetry. But the distortion around a pair of zinc atoms 
will no longer have cubic symmetry and the lattice will be extended more 
along the axis of the pair than in any other direction. In the absence of 
any stress the pair axes will be distributed at random over all the 
erystallographically permissible directions ; but if, say, a tensile stress is 
applied to the lattice the axes of the pairs of atoms will tend to orient 
themselves along those allowed crystallographic directions which make the 
least angle with the stress direction. A certain time will be required for 
the establishment of this new equilibrium, in which the pair axes are no 


Fig. 1. 


© OBSERVATIONS AT | 
620 CYCLES/SEC. [ 


700 —— fe} ° 


° 


Q-1 x 10-5 


Internal friction plotted against 1/T for the transverse 
oscillations of a single crystal «-brass bar (Zener 1947). 


longer distributed at random among all the permissible directions, so that 
the strain will lag behind the stress. This hysteresis will give rise to 
relaxation and other anelastic effects in the specimen, such as internal 
friction. 

Zener (1943) was the first to observe an anelastic effect in «-brass which 
he was later able to attribute to this relaxation mechanism (Zener 1947). 
Further measurements, also on «-brass, were later made by Ké (1948 a). 
Zener measured, as a function of the temperature T, the internal friction, 
Q-', of the transverse oscillations of constant frequency of a single crystal 


— 


Diffusional Coefficients in F.C.C, Substitutional Solid Solutions 675 


bar of «-brass; Ké made similar measurements on the torsional oscillations 
of a coarsely crystalline «-brass wire. Their results are reproduced in 
figs. 1 and 2. (The rapid rise in Q-! above about 350°C. in fig. 2 is 
associated with relaxation of the stress by grain boundary slip, and does 
not concern us here.) 

Assuming that the relaxation time vp, for the atomic process we have 
described depends upon the temperature through the equation 


ip TR, exp (H/RT), . . - <« . . . (1) 
Fig. 2, 


TEMPERATURE OF MEASUREMENT(°C) 
425 400 375 350 325 300 275 250 


4 ie 0068 


H=40,000 calories/mole 


INTERNAL FRICTION (Q°') 


Internal friction plotted against 1/T for the torsional oscillations of a coarsely 
polycrystalline a-brass wire. f, oscillations at 2-3 ¢.p.s.; f,, oscillations 
at 0-6 c.p.s. (Ké 1948 a), 


where vp, is a constant and H the activation energy for the relaxation 
process, Zener showed that his results were well represented by the 


equation 


- Rahiteeen cl2 
Q = AB ae (2) 


a is the angular frequency at which the measurements of Q~* were made 
and AB is the relaxation strength for the process, defined as 
Eyp—Ep 
Ey ’ 
BAZ 


AK= 


676 A. D. Le Claire on the Anelastic Measurements of 


Here, E,, is the unrelaxed Young’s modulus, 7. e. the modulus measured 
in a time t<7, by dynamic methods and Ex the relaxed modulus measured 
after a time t>7R. 

Since equation (2) represents well the experimental results, we can infer 
from formal anelasticity theory (Zener 1948) that the relaxation process is 
characterized by a single relaxation time and that the rate at which the 
distribution of solute atom pair axes approaches equilibrium is given by 
the simple logarithmic relation 


dAN AN 
“dt . =—_-- eye ’ . . . . . . ° a (3) 


where AN is the number of solute atom pairs whose axes must still reorient 
for equilibrium to be attained. 

Since the atomic process involved in the relaxation is clearly the 
migration of one member of a pair of zinc atoms from one lattice position 
to another, that is a diffusion process, the assumption expressed by 
equation (1) is a very reasonable one for diffusion rates are always found 
to vary exponentially with 1/T. For the same reason tz will be closely 
related to and of the same order of magnitude as 7, the mean time of a stay 
of an atom of zinc on one lattice position. We shall make use of equation 
(3)in deriving the exact relation between 7 and 7, and then, having obtained 
a relation between 7 and the diffusion coefficient D, we shall be in a position 
to calculate D from measured values of Tp. 

The curves in figs. 1 and 2 show a maximum at a well defined tempera- 
ture, as always occurs when the extent of the relaxation is limited (Zener 
1943). From equation (2) we see that provided JE is independent of T, 
the internal friction is a maximum for that temperature at which the 
corresponding relaxation time Tp is given by 


Tpwl, 8. ee SS ee 


From the maxima of the three curves in figs. 1 and 2 we can obtain three 
values of rz, and therefore of D, corresponding to the temperatures at which 
the maxima occur. The heights of the maxima of the two curves in fig. 2 
are sufficiently equal for us to assume that over the temperature range 
covered by each peak 4E is sufficiently independent of temperature for the 
relaxation time at the temperature of the maxima to be given by 
equation (4). Further, the position of the maximum is not affected by the 
superposition (in fig. 2) of the internal friction due to relaxation by grain 
boundary slip. Equation (2) represents a function symmetrical with respect 
to its maximum at Tp w=1,s0 that we can separate out the high temperature 
portions of the full line curves in fig. 2 into the contributions from grain 
boundary slip and from the preferential reorientation of solute atom pairs. 
When this is done it is found that at the temperatures of the maxima, the 
variation with temperature of the contribution from gran boundary 
relaxation is negligible. 


Diffusional Coefficients in F.C.C.. Substitutional Solid Solutions 677 


§ 3. THE RELATION BETWEEN D AnD 7. 


Consider diffusion taking place in the face-centred cubic lattice along the 
_ [100] direction. If “a”’ is the lattice parameter, then there are successive 
planes of atoms, the (200) planes, at distances of a/2. Each atom in any 
one of these planes may jump into any one of the twelve nearest neighbour 
positions. Four of these twelve possible jumps take the atom into the 
next (200) plane a distance +a/2 measured along the x direction, another 
four into the next (200) plane a distance —a/2 along the x direction, while 
the remaining four jumps lead to no movement along the «x direction. 
Suppose now that a concentration gradient exists in the lattice so that 
there are («++ 4a) atoms of one type per unit area on one (200) plane and 
(x— 4a) atoms per unit area on the next adjacent (200) plane, a distance 
a/2 away. If 7 is the average time of stay of an atom on a lattice site, so 
that each atom jumps 1/7 times per second, the nett flow of atoms down the 
concentration gradient will be . 
L 1 2 Ma 
(«+ 4a) 3 —(a— Ax) = eeryee 
since of the 1/7 jumps per second, only 1/3 carry the atom forward and 1/3 
carry it backward along the x direction. 
But the concentration gradient is given by 


(A a) /5= 8 Aa 


| 


G2 e Vaz Seeeeas) 7 
so that the diffusion coefficient D will be given by 
a> 5 
D= 127 ° . : . . . : . . . ( ) 


The same result is of course obtained along whichever axis one considers 
the diffusion to be taking place. 

Equation (5) can be more readily obtained by making use of a general 
result of random walk theory (Chandrasekhar 1943). Ifthe probability of 
a migration, by diffusion, of amount and direction r is given by R(r), then 
the diffusion coefficient for diffusion say along the a axis is given by 


1 
= 2 
D= 5 <2), 


where (x?) is the second moment of R about x. In a face-centred eubie 
lattice R is zero in all but 12 directions and for all values of r except 
r=a,/2. Its second moment about x is then readily seen to be a*/6, so 
that we arrive again at the expression for D given in equation (5). 


§ 4. THE RELATION BETWEEN 7 AND TR. 

The relation between 7 and rT, depends to some extent upon the direction 
of the stress applied to the crystal. We shall derive in detail the relation 
for one principal direction of applied tensile stress and then more briefly 
the relation for two other principle directions. 


678 A. D. Le Claire on the Anelastic Measurements of 


(a) Tensile stress applied along [111] direction. 


The pair axes of the solute atoms may be along any of the six ¢110) 
directions. Of these six directions, three, namely [011] [101] and [110] 
have a direction cosine of (2/3)! with respect to the [111] axis, and we shall 
refer to these as the “ p class ’ of pair axes. The remaining three, [011] 
[101] and [110] are normal to the [111] direction and will be referred to as 
the “n class”’ of pair axes. A tensile stress along the [111] direction will 
then tend to reorient pair axis from the n class into the p class, 7. e. tend 
to bring pair axes more nearly parallel to the stress axis. 

Now if we assume a vacancy mechanism for the diffusion, a solute atom 
pair can only reorient itself, remaining a pair, when a vacancy occupies a. 
position which is adjacent to the positions of each of the members of the 
pair. It is readily shown that if in this circumstance one member of the 
pair changes places with the vacancy, 7. e. diffuses, then the probability is 
one half that the concomitant reorientation of the pair leads to a change of 
the pair from one class (p or n) to another. That is to say, only one half 


Fig. 3. 


Face-centred cubic lattice. 


of those diffusion processes which lead only to a change of orientation of 
the pair (and not a break up of the pair) will lead to a relaxation of a stress 
along the [111] direction. This can easily be seen by considering all 
possible initial positions of pairs of atoms and of vacancies. Take for 
example the case of the possible reorientations of an [011] pair, a member of 
the p class, composed of atoms at (000) and (044). There are four possible 
positions for a vacancy which is to be a common nearest neighbour of the 
pair of atoms, and with which either of the atoms may change places so as 
to reorient the pair. These are given in the first column of Table I.; the 
second and third column show the class of the pair after it has been 
reoriented by a migration of the (000) and (044) atoms respectively. 

As can be seen, only one half of the reorientation processes lead to a change 
in the class ofthe pair. The same result will be true for the other members 
of the p class, for the axes of all members of this class are symmetrically 
disposed with respect to the [111] axis. A similar study of the possible 


Diffusional Coefficients in F.O.C. Substttutional Solid Solutions 679 


reorientations of a member of the n class leads to the same conclusion, 
that only half the reorientations lead to a change in the class of the pair 
considered. 

Consider again the possible migration paths of the members of any pair 
of solute atoms. Each solute atom has four neighbours in common with 
the other and seven neighbours not in common. Each solute atom may 
migrate into any of the 11 positions available to it and we assume that 
vacancies occur on the 11 positions with equal probability. 

The probability per unit time that one atom moves =1/r. The 
probability that when it moves it occupies one of the four common sites 
so that the pair remains a pair but is reoriented—7. e. probability of 
reorientation by migration of one atom =4/117r. 

Similarly, the probability of reorientation by migration of the other 
atom =4/117. 

Thus the probability per unit time of a reorientation of the pair while 
remaining a pair =8/llr. Therefore the probability per unit time of a 
reorientation leading to a change in class of the pair =4/11r. 


TaB_e I. 
Reorientations of a [011] (p-class) of Solute Atom Pair. 


Pair axis originally [011]. 
Position of common Class of pair axis after 
neighbour vacancy (1) Migration of | (2) Migration of 
atom at (000) atom at (044) 


n [101] p (110) 
p [101] n [110] 
n [110] p (101) 
p [110] n [101] 


| 


rien O— 
~_—— 


Similarly, it can be seen that the probability per unit time of the pair 
breaking wp, by movement of either atom into one of the seven positions 
allowed to each atom =14/117. 

We shall assume that for each pair that breaks up in this way, another is 
formed of the same class at some other point in the lattice by diffusion of 
lone solute atoms toward one another, i. e. the number of pairs of each class 
is only changed by a reorientation process, and the break-up of a pair leads 
to no nett change in the total number of its class. 

In stress free equilibrium let there be N} p pairs of solute atoms and. 
N° n pairs. Since there are equal numbers of possible p and 7 directions 
we shall have NS—N°. Let us suppose that at any time after the application 
of a tensile stress along the [111] direction, AN pairs of solute atoms have 
changed from the 7 class to the p class. If N, and N,, then represents the 
numbers of p and n pairs respectively 


N,=N?+ AN, 
N,=No— AN. 


680 A. D. Le Claire on the Anelastic Measurements of 


We assume also that the total number of pairs remains constant so that 
N,, +N,=N?+Nyj,. 

If the tensile stress is now removed N,, will change back to its equilibrium 
value of N?. With all the above assumptions the rate of this change is 
given by 

aN, se AN 
pipe dt eee 
where the first term on the right hand side represents the decrease in N,, 
due to p pairs changing to n pairs and the second term the increase in N,, 
due to n pairs changing to p pairs. Equation (6) reduces to the form 
dAN 8AN 
—— =———, - .- sss. . (7 
dt 117’ (7) 
and by comparison with equation (3) we see that the measured relaxation 
time is related to the mean time of stay 7 by the equation 


8 
T= Tl TR: . . . . . e . . e (8) 


4 4 
(NY+4N) -+(MR- AN). ss + 6) 


We have dealt so far with the relations for the changes towards a stress 
free equilibrium, but the same result can be deduced for the opposite 
case where the system is changing towards an equilibrium in the presence 
of a tensile stress. In this case, the probability per second of an atomic 
migration which reorients a solute atom pair from the x class to the p class 
is greater than the probability of a migration which leads to a change 
from the p to the n class. If the probability of any migration in the 
absence of a stress is given as a function of temperature by 


1 1 
7m = Toc (—H/RT), 


then the unequal probabilities 1/r,,,, and 1/7,,,, for atomic migrations in the 
presence of a tensile stress which lead to class changes of n to p and of 
p to n respectively may be written 


+ =+ exp (—(H—U)RT), 

A 7 Sera AD 
— == exp(—(H+U)/RT), 

pn 


where 2U is the change in energy arising from the change in class of a 
solute atom pair in the presence of the tensile stress. At equilibrium 
the ratio of the numbers of p and of n pairs is given by exp (2U/RT) from 
which we deduce that 4N}, the number of p pairs in excess of those 
existing in the absence of a stress, N), is given by 


UN? 
AND eer 25. be Ween ete geen ae (10) 


since U is small compared to RT. 


Diffusional Coefficients in F.C.C. Substitutional Solid Solutions 681 


Equation (6), representing the rate at which the number of p pairs is 
changing towards the equilibrium number, is now rewritten as 
aN, be d(AN$— AN) _ 4 4 


dt dt eAN) iy Ce Noe tN) 372 


np 


by equations (9) and (10). 
Since (4N}— AN) is now the number of p pairs which must yet reorient 


for equilibrium to be obtained, we obtain again by comparison with 
equation (3) 


(b) Tensile stress applied along [001] direction. 


Of the six solute atom pair axes, four, namely [011] [101] [011] and [101]. 
make an angle of 45° with the stress axis and two, [110] and [110] are 
normal to the stress axis. The four members of the first group are of the 
p class and the two members of the second group are of the n class. Any 
reorientation of a member of the class changes it to the p class, but only 
one half of the reorientations of p class pairs lead to a change to class n. 
A calculation along lines similar to those employed in the previous case 
gives 

12 
tk rare g. aeeere: (24) 


(c) Tensile stress applied along [101] axis. 


This is slightly more complicated by there being now three classes of 
pair axes. The [101] pairs are parallel to the stress axis (p class), the [101] 
pairs are normal to the stress axis (n class) and the remaining four types of 
pairs [011] [110] [011] and [110] make an angle of 60° with the stress axis 
(s class). The analysis is simplified however by the fact that there are 
limitations on the changes of class that are possible. When pairs of the n 
and p class reorient, by diffusion of one of their members, they can only 
change into the s class ; when a pair of the s class reorients the probability 
is one half that it remains in the s class, one quarter that it changes to the 
p class and one quarter to the n class. The application of the tensile 
stress along [101] will then lead to an increase in the number of p pairs, a 
decrease in the number of 7 pairs and an increase or decrease in the number 
of s pairs according to whether the energy difference between an s and n 
pair is greater or less respectively than the energy difference between a p 
and s pair. A calculation of the rate at which the number of s pairs’ 
changes towards its equilibrium value, and comparison of the result with 
equation (3) gives again 

12 
T= 7] TR 


682 A. D. Le Claire on the Anelastic Measurements of 


§5. CALCULATION OF D AND COMPARISON WITH DrrEecTLy MEASURED: 
VALUES. 


We have calculated the relation between 7 and 7, for three representative 
directions of stress. Since Ké used only a polycrystalline wire in his 
experiments it is reasonable to suppose that the appropriate value of + 
to use in the calculation would be some mean value of 87p/11 and 127,/11. 
Zener used a single crystal bar but does not report its orientation so that 
the best we can assume again is that the appropriate value of 7 to use will 
be near to the range 8r,/11 to 127,/11. We shall then calculate from the 
data given by Ké and by Zener the values of D corresponding to these two 
values of 7, for which the relations are from equations (4), (5), (8) and (11). 


22ava' 
ein em.2 sec.-! for r=87p/11, 
22nva? 
Caen 
v is the frequency in cycles per second at which the anelastic measurements: 
were made and ais the lattice constant. Both Ké and Zener used approxi- 
mately 70/30 brass for which we take a=3-675 A. The appropriate data. 
and results are set out in Table IT. 


em.?sec.—! for r=127p/11, 


Taste ITI. 
Calculation of Diffusion Coefficient from Anelastic Measurements. 
Temperature at which maximum : 


occurs in Q-!~1/T curve °K. .. 689 614 588 
1000/T Br ae = ne 1-450 1-628 1-700 
Frequency of measurement v.c.p.s. 620 2-3 0-6 
D cm.2sec.—! for 7=87,/11.. -- 6:04x10-! 2-:24x10- 5-84 10-16 
Log D : s Fy 13-781 15-350 16-766 
D cm.?sec.=1 for r=12 73/11 ~» 4°02 10729) 11-49 105 S80 pc 1G eas 
Log D et a 3 ik 13-604 15-173 16-590 
Reference... a oh .. Zener 1947 Ké 1948 Ké 1948 


It remains to compare these results with the directly measured coefficients: 
of diffusion in «-brass. A large number of investigators have studied this. 
system and their work has recently been reviewed and assessed by 
Kubaschewski (1950). The only useful results for our purpose are those of 
Kirkendall (1942) and of Kirkendall, Thomassen and Upthegrove (1939), 
for these only were obtained for well defined zine concentrations, close to: 
those used by Ké and Zener. Their values of D are given in Table III., 
but they cannot be compared with our calculated values of D until certain 
corrections have been made. These are, in order of importance, 

(a) A thermodynamic correction, 

(b) A correction arising from the Kirkendall Effect in «-brass (Darken 

1948), ' 

(c) A concentration correction. 


Diffusional Coefficients in F.C.C. Substitutional Solid Solutions 683 


(a) Thermodynamic correction 


In a conventional diffusion coefficient measurement the driving force 
for the diffusion flow is derived not only from the concentration oradient, 
in terms of which alone the diffusion coefficient is defined, but also since 
the solution is never ideal, from an internal energy gradient. The partial 
molar internal energy of each species present varies with concentration so 
that whatever the direction of the concentration gradient, there is also a 
tendency for atoms to flow in the direction of their decreasing partial 
molar internal energy. In extreme cases this latter tendency is 
stronger than the tendency to equalize the concentrations (7. e. increase 
entropy of mixing) and the phenomenon of “ uphill” diffusion against 
the concentration gradient occurs. The author has discussed this at 
length elsewhere (Le Claire 1949) and the result is that the diffusion 
coefficient D, measured in the presence of a concentration gradient, is 
related to the true diffusion coefficient D., that would be obtained in the 
absence of a chemical concentration gradient, say by the use of tracers or 
of the aneiastic method we are now discussing, by the relation 


ont : 
Dy=Do(14 5e4), EPs forsee ss %n (iL) 


where y is the activity coefficient of the diffusing species is c and the 
concentration at which D,, was measured. 

The activity data of Hargreaves (1949) as smoothed by Fisher, Holloman 
and Turnbull (1948) have been used in computing the values, shown 
in Table ITI., of the thermodynamic factor in equation (11). 


Tasie ITI. 

Directly Measured Diffusion Coefficients in «-brass with Corrections. 
EK Je £ at 873 928 993 1053 
1000/T a ais 1-146 1-078 1-007 0-949 
c per cent Zn oY 28-3 26:8 25:3 26-0 
* True’ ¢ per cent «... 29-9 28-4 26:9 26-0 
DD, cm.2sec—* se 8-6.10-11 3°54.10-1° 1-31.10-® 3°8.10-° 
(1+ give 2) ‘ 2-46 2-33 Poe 2-00 

é log ¢ 
Daiem.? sec:5t ae 3-49.10-11 1-52.10-1° ~— §-98.10-1° 1-82.10-° 
m aor ze Be —0-02 : +0-01 +0-10 +0-04 
D, em?*sec.7? 08 3:99:10722 .1276:10=1" . .6-99.10-1° 2-4.10-9 
Page een? we 11-601 10-245 10-844 9-330 


(b) Kirkendall Effect correction. 
A second but smaller correction is necessary because the anelastic 
measurements give the separate diffusion coefficient of zine in a- -brass 
whereas it can be shown (Darken 1948, Le Claire 1949) that D, is composite 


in the sense that 
Dp=CynDeout Cou mm: 


684 A. D. Le Claire on the Anelastic Measurements of 


where D,,, and Dg, are the separate true diffusion coefficients of zine and 
copper and cy, and cy, are the fractional concentrations of zine and 
copper at which the measurement of Dy was made. In view of 2 
Kirkendall effect in «-brass, Dz, and D,, are not equal and from Darken’s 
calculations, based on the observed magnitude of the Kirkendall effect, we 
find 

Dan 

‘Ds =a holla. 
We shall assume that this relatively small correction is the same at the 
higher concentrations (c,,—-29) and lower temperatures (~300° C.) 
employed in the anelastic measurements. 


(c) Concentration correction. 


There is a third correction m for the difference in concentration between 
the samples used by Ké and Zener (~29 per cent Zn) and those of Table 
III. This difference is not quite as great as appears at first sight. The 
content of zinc as given in the papers of Kirkendall and of Kirkendall et al. 
was measured by determination of the lattice parameter, but the relations 
between composition and lattice parameter used in the two papers do not 
appear to be the same. Using the precison lattice parameter determina- 
tions of Owen and Roberts (1939) to obtain the correct zine concentrations 
corresponding to the lattice constants reported, we find that the first three 
values in Table III. must be increased by approximately 1-6 to give the 
true weight per cent of zinc, but that the fourth is correct. The corrected 
values are given in the fourth column of Table ITI. 

Using the data of Sieth and Kraus (1938) for the variation of D, with 
concentration, we find that D, as calculated from Dy increases by about 
4 per cent for a rise of 5 per cent in the concentration of zinc in the range 
25-30 per cent zinc. This correction, though very small, has been applied 
to the data in Table IIT. in order to convert all values of D, to the values 
corresponding to the concentration of 29 per cent zinc used by Ké. But if 
we use the data of Rhines and Mehl (1938) the correction is very much 
larger and we find that a 2 per cent increase in zinc concentration leads to a 
30 per cent increase in Dy. The data of Seith and Kraus are perferred 
because only they measured Dy as a mean value over a relatively small 
range of concentration. Also, we should expect their accuracy to be the 
same for all concentrations, whereas the Matano method of analysis used by 
Rhines and Mehl falls off in accuracy towards the limits of the much wider 
concentration ranges (0-30 per cent zinc) that they used. This is clearly 

. seen by the way in which their D,, increases rapidly towards the high zine 
end of the concentration range used, irrespective of the actual concen- 
tration of zine at this end of the concentration range. (Compare samples 
30 and 32 of their fig. 9.) The much more rapid increase in Dy with 
increasing and high zinc concentration found by Rhines and Mehl is then 
very suspect. 


Diffusional Coefficients in F.C.C. Substitutional Solid Solutions 685 


Our values of D calculated from the anelastic measurements are then to 
be compared with the corrected values D, of directly measured diffusion 
coefficients, given by 


D.= | logy +m | X1-15. 
0 log ¢ 
Fig. 4 
TC 


800 700 600 500 400 300: 


CORRECTED VALUES 
OF DIRECTLY MEASURED 
-9 DIFFUSION COEFFICIENTS. | 
r  KIRKENDALL (1942). 
-|- KIRKENDALL, THOMASSEN, 
AND-UPTHEGROVE (1939). 
-10 ACTIVATION ENERGY 
Q =< 40,600 CALS/MOLE. 


-le 
LOG, D. 


-13 


DIFFUSION COEFFICIENTS 

-14 > CALCULATED FROM 
ANELASTIC MEASUREMENTS. x 
© TENSILE STRESS ALONG (II!) 


{0} 
© TENSILE STRESS ALONG(IOI) a 
-15 OR (001) Ke. 


O09 Ot 12 3 14 1S 16 17 
1000/ T. 


Comparison of directly measured and calculated diffusion coefficients. 


These’ values, with their logarithms, are given in rows 9 and 10 of 
Table IIT. On the assumption, which was justified in the analogous case of 
interstitial diffusion (Wert 1950 a), that the well known‘ relation between 
D and T 

‘D=D, exp (—H/RT) 
holds true with constant H and D, down to the temperatures at which the 
anelastic measurements were made, log D, is plotted against 1/T in fig. 4, 
and the straight line through the points extrapolated down to these temper- 


atures. The values of log D calculated from the anelastic measurements 


686 A. D. Le Claire on the Anelastic Measurements of 


are plotted on the same figure, and show an excellent agreement between 
the measured D, and the values of D calculated from Ké’s results. By 
measurement of the displacement of the two curves in fig. 2, Ké calculated 
directly from his results an activation energy for the relaxation process of 
about 40,000 cals./gm.mole which agrees excellently with the activation 
energy of 40,600 cals./gm. mole calculated for the slope of the line in fig. 4. 
This agreement may also, of course, be appreciated from fig. 4 alone. 

The diffusion coefficient calculated from Zener’s result shows less good 
agreement with the directly measured values, and it is difficult to see how 
such a discrepancy could arise. No likely error in the measurement of v 
could account for it, and it is hardly possible that a direction for the stress 
axis could be found for which the relation between 7 and 7p would differ so 
much from those already found as to give a value for D very much closer 
to the line of fig. 4. However, if we suppose there to have been an error 
in temperature measurement of 22° C., the value of D calculated for the 
case=127,/11 would lie on the straight line in fig. 4. Zener (1943) 
does mention that for the purposes of this particular experiment the 
internal friction apparatus was heated “as hot as it was thought 
capable of standing ”’ and that “the apparatus, in fact, was damaged by 
the experiment”. Itis then conceivable that under such severe conditions 
of usage an error in temperature measurement of about 20°C. (at 420°C.) 
may have occurred. There is the further point against accepting Zener’s 
result as being as reliable as Ké’s, that the activation energy of 33,600 
cals./gm.mole which Zener calculated from the slope of the curve in fig. 1 
is much less than the values of Ké and of fig. 4. Ké’s measurements were 
in fact made primarily to give a more accurate value of this activation 
energy. 


§ 6. Discussion. 


The good agreement obtained between the directly measured and properly 
corrected diffusion coefficients, and those calculated from anelastic 
measurements both confirm Zener’s original indentification of a relaxation 
peak as due to the mechanism of preferential orientation of solute atom 
pairs, and warrants the use of this method in the study of diffusion in 
substitutional solid solutions of metals. There are many advantages to be 
gained from such a use. Measurements can be made at much lower 
temperatures than are necessary for the conventional methods and in a 
very much shorter time, for we are here concerned with diffusional motion 
over a distance of only one atomic spacing. This means that D can be 
studied over a much wider range of temperature than previously so that a 
more precise knowledge of activation energies and of their possible depen- 
dence on temperature becomes possible. Also for the first time we have 
the possibility of studying the dependence of D on such variables as cold 
work, and an excess number of vacancies introduced by quenching from 
high temperatures, both of which are greatly changed during the long times 
required for conventional measurements of D. Studies of the variation 


Diffusional Coefficients in F.C.C. Substitutional Solid Solutions 687 


_ of D over considerable ranges of concentration could also be made in a 
very much shorter time than by the old methods and would have the added 
advantage that D would be measured at a fixed concentration rather than 
over the range of concentrations that is necessary in the conventional 
methods. In view of the great need in diffusion theory for accurate 
experimental data with which to compare the theoretical predictions, the 
advantages of the anelastic method are that it is rapid, that the 
concentration is well defined and also that it provides the theoretically 
more significant true or ideal diffusion coefficient (Le Claire 1949) ; 
thermodynamic data from which to calculate the true from the measured 
diffusion coefficient are then not required. 

The anelastic method has also of course certain limitations. The solute 
and solvent atoms must differ appreciably in size and the concentration 
must not be so low that the number of solute atoms pairs is inappreciable. 
For too low a concentration and too small a difference in size, the relaxation 
strength, which measures the height of the peak in the curve of Q-1 
against 1/T, may become too small for any peak to be observed. The 
extent of these limitations is probably best determined by experiment. 
However, as we might expect, and as Zener shows (1947,) 4K, which is 
twice the maximum value of Q~1 (see equation (2)) is proportional to the 
total number of solute atom pairs. If this number was proportional 
to the actual solute concentration we might expect to be able to observe a 
peak for a concentration as low as say 3 per cent zinc, for then the curve in 
fig. 1 would have an observable maximum as about Q-1=120-10-°. The 
number of solute atom pairs will, however, fall more rapidly than the 
concentration, so that 3 per cent will probably be below the lowest con- 
centration at which anelastic measurements of D can be made on «-brass. 
Zener (1947) has shown that the relaxation strength is roughly proportional 
to the square of the rate of change of lattice parameter with composition 
so that the larger the difference in size between solute and solvent atoms, 
the lower will be the concentration at which a measurable anelastic peak 
will be obtained. Zener also shows that JE, and therefore also the mini- 
mum concentration at which an anelastic peak can be observed, will vary 
with the direction of the applied stress, but that it never becomes zero, as 
it does in the case of body-centred cubic solid solutions. 

Another limitation is that the method only measures the diffusion 
coefficient of one of the atomic species present, although this may not 
always be a disadvantage. 

Further attention to some of the assumptions made in the development 
of a relation between 7 and Tp might be necessary. We have assumed that 
the total number of solute atom pairs is always constant and that the 
number of pairs in a particular class only changes as the result of a 
reorientaion process ; a pair which breaks up is always replaced by another 
of the same class as a result of lone solute atoms diffusing towards one 
another to form a pair. However, it is unlikely that attention to such 
refinements would change appreciably the values of the diffusion coefficients 


688 On the Anelastic Measurements of Diffusional Coefficients 


obtained above. More important may be the possible cooperative 
effects arising from interactions between pairs of solute atoms and altering 
the rate of change towards equilibrium. This could be studied, as was 
the interaction between interstitial solute atoms in tantalum (Ké 1948 b), 
by measurements of JE over a considerable range of temperature and the 
plotting of 1/4E against T. Zener showed (Zener 1948) that 4E varies 
with the absolute temperature T in the manner 


To 
T—fT,’ 
where T, is a temperature independent factor and 6 a numerical coefficient 
which takes account of solute atom interaction. The intersection with the 
T' axis of a plot of 1/4E against T will then give a measure of B. 

Finally, it will be necessary to study the relation between 7 and 7x for 
any arbitrary direction of stress. This is important because it is not 
always easy to grow large single crystals with precisely the required 
orientation and because from it we can derive an accurate mean relation 
applicable to polycrystalline material. 

It is hoped shortly to publish a further paper dealing with the extensions 
and refinements of the theory. 


4E= 


ACKNOWLEDGMENTS 


The author wishes to thank Dr. H. M. Finniston, Messrs. B. G. Childs 
and R. 8. Barnes for useful discussions and is indebted to Sir John 
Cockcroft, Director, A.E.R.E., for permission to publish this paper. . 


REFERENCES. 


CHANDRASEKHAR, 8., 1943, Rev. Mod. Phys., 15, 1. 

DarKEn, L. 8., 1948, Trans. Amer. Inst. Min. Met. Engrs., 175, 184. 

Fisuer, J. C., Hottoman, J. H., and Turnsut, D., 1948, Trans. Amer. Inst. 
Min. Met. Engrs., 175, 202. 

HARGREAVES, R., 1939, J. Inst. Metals, 64, 115. 

Kf T’rne Sut, 1948 a, J. App. Phys., 19, (3), 285 ; 1948 b, Phys. Rev., 74 (1), 16. 

KirKENDALL, E. O., THomassen, L., and UpTHEGROVE, C., 1939, Trans. Amer. 
Inst. Min. Met. Engrs., 183, 186. 

KirKENDALL, E. O., 1942, Trans. Amer. Inst. Min. Met. Engrs., 147, 104. 

KuBascHEwskI, O., 1950, Trans, Far. Soc., 46, 713. 

Le Cuatre, A. D., 1949, Progress in Metal, Physics, 1, (Ed. B. Chalmers ; 
London : Butterworths) Chap. 7. 

Owen, E. A., and Roserts, E. W., 1939, Phil. Mag., 27, 294. 

ee tase ne and Ment, R. F., 1938, Trans. Amer. Inst. Min. Met. Engrs. 

, 185. 

SerruH, W., and Kraus, W., 1938, Zeit. f. Electrochem., 44, 98. 

Wert, C. A., and Zengr, C., 1949, Phys. Rev., 76 (8), 1169. 

Wert, C. A., 1950a, Phys. Rev., 79 (4), 601; 1950b, J. App. Physics, 21 (11), 1196. 

ZENER, C., 1943, Trans. Amer. Inst. Min. Met. Engrs., 152, 122 > 1947,-Phys. 
Rev., 71 (1), 34; 1948, Elasticity and anelasticity of metals (Chicago : 
University Press). #4 


> 


[ 689 ] 


LXXI. The Distribution of Energy in Randomly Modulated Waves *. 


By Davin MIDDLETON, 
Cruft Laboratory, Harvard University, US.A.f. 


[Received March 30, 1951.] 


ABSTRACT. 


The distribution of energy in high-frequency oscillations which are 
modulated by stationary, random disturbances such as shot and thermal 
noise is here examined in a number of important practical situations. 
Tn (a) the theory for frequency- and phase-modulation by normal random 
noise is developed in particular, as a model for speech-modulated carriers. 
The effect of spectral shape of the modulating noise on the intensity 
distribution of the modulated carrier is considered, and it is found that 
the lowest-frequency components of the modulation are most significant 
in this respect. For example, in frequency-modulation the carrier term 
is entirely absent when the modulating noise has a finite spectral density 
at and near zero-frequency, while in phase-modulation by the same 
disturbance the carrier component may contain a noticeable fraction of 
the modulated wave’s energy. Particular attention is also given to the 
limiting cases of very slow frequency deviation, and it is found that such 
adiabatic sweeps spread the original spectrum. Part (b) contains a 
parallel discussion for the amplitude-modulation by. normal random 
noise, including the effects of possible over-modulation. 


1. INTRODUCTION. 


BECAUSE noise is the unfailing and undesirable companion of intelligence 
transmitted or received by electronic and acoustical systems, it is essential 
for any proper theory of communication to provide suitable methods for 
studying the physical properties of a noise wave and its interaction with 
a desired signal. On the one hand a successful technique of measurement 
is required to control or minimize the noise, and on the other an adequate 
theory is necessary to guide experiment and interpret the data. 
Accordingly, the purpose of this paper is to present a number of new 
results, obtained by the analytical methods developed in recent years 
(Wiener 1926, 1930, Khintchine 1934, Cramér 1940, Chandrasekhar 1943, 
Bennett 1944, Rice 1944, 1945, Blanc-Lapierre 1945, Wang and Uhlenbeck 
1945, Boonimovich 1946, Middleton 1946, 1948 a,b, 1949, van der Pol 


* The research reported in this document was made possiblé through support 
extended Cruft Laboratory, Harvard University, jointly by the Navy Depart- 
ment (Office of Naval Research), the Signal Corps of the U.S. Army, and the 
U.S. Air Force, under ONR Contract N5ori-76, T.O.1. 

+ Communicated by Sir Edward Appleton, F.R.S. 


SER. 7, VOL. 42, NO. 330.—JULY I951 3B 


690 David Middleton on the Distribution 


1946, Van Vleck and Middleton 1946) for the following important 
problems*. (In all the cases considered here the noise is assumed to 
belong to the fluctuation type, characteristic of shot and thermal noise 
which belongs to a normal random process (Wang and Uhlenbeck 1945, 
Middleton 1948 a, 1949). Impulsive noise, such as atmospheric and solar 
static, is not treated, although the general methods are, the same.) Our 
treatment in the present paper is limited to a discussion of noise only ; 
analysis of the case of signal and noise is considered elsewhere. We 
discuss specifically :— 

(i.) Carrier phase- or frequency-modulated by noise: this problem is of 
particular interest when normal random noise is used as an approximate 
model of speech. 

(ii.) Carrier amplitude-modulated by noise: this situation is similar 
to (i.), except that now amplitude- rather than frequency-distortion of 
the carrier wave is the mechanism producing the modulation. The 
important case of over-modulation is discussed. 


We remark further that, apart from the specific applications to 
problems (i.) and (ii.), the results are needed in the general theory of 
noise measurement, for here the central objective is to be able to determine 
by measurements on the output wave following various linear and non- 
linear operations (such as amplification, rectification, clipping, mixing, 
modulation, discrimination, etc.), the ‘‘ structure ” of the original input 
disturbance, 7.e. whether or not it is an amplitude- or frequency- 
modulated wave, how the noise and carrier occur together, and other 
qualitative and quantitative data. 

The quantities of chief physical interest are (a), the mean or steady 
component of the disturbance, (b), the mean intensity of the wave, and 
(c), the spectral distribution W(f) of the mean intensity. The latter 
quantity is in fact sufficient to give us the other two ; the mean intensity 
is obtained as the area under the spectral density curve W(f), while the 


* The details of the analysis, including a discussion of the case of signal and 
noise, are given in a series of papers, ‘‘On the Distribution of Energy in Noise- 
and Signal-Modulated Waves ’”’ (D. Middleton), scheduled to appear in the 
Quarterly J. of Applied Mathematics (U.S.A.). For the complete and detailed 
discussion see D. Middleton, On the Distribution of Energy and Noise- and 
Signal-Modulated Waves, Cruft Laboratory Technical Report No. 99, March 1, 
1950. 

} Recent experiments of W. B. Davenport, Jr. (A Study of Speech Probability 
Distributions, M.1.T. Report No. 148, August 25, 1950) indicate that speech is 
more satisfactorily described statistically in terms of an impulsive or ‘ static ” 
noise model, where overlapping among individual (and independent) pulses 
is assumed to be small, of the order of 30-50 per cent of the time. This is 
different from the usual model of fluctuation or normal random noise, which 
assumes complete and highly multiple overlapping between the elementary 
transients. However, such normal random noise has the great advantage of 
mathematical simplicity, and its use as a speech model is therefore justified on 
these and physical grounds, at least as a first approximation. 


of Energy in Randomly Modulated Waves 691 


(square of the) steady component is given by the constant (or frequency- 
independent) term in the expression for the spectrum. It is assumed 
that we are dealing with a stationary (and ergodic) random process, 
namely, a process for which the underlying mechanism does not change 
with time. Then time averages and ensemble or statistical averages 
are equivalent (James, Nichols and Philips 1946) to within a set of 
random functions of probability zero, so that if we represent our stochastic, 
time-dependent disturbance by y(t) we may write the steady component (a) 
as * 


. pe il eee io) 
Culto) Yo = lim | a ylto) dto=ylto)= | yWily) dy, - (1.1) 
Tsao 0 — 0 


and the mean intensity (0) 
‘ lo a cO 
(y(lo)” Jav= lim 5 ib Y(to)” dto=y(to)” = be yPW iy) dys. (1.2) 
W,(y) dy is the probability that (at any initial time ¢,) y lies in the range 
y, y+dy. The moment of greatest interest, however, is given by 


T 
R(t) = (y(to)y(Co+4) Dav= an nae Y(to)y(tot+t) dt. . . (1.38) 


The quantity R(t) is the auto-correlation function of y and may be found 
statistically when the second-order probability density W,(y,, ys; t) is 
known ; W, has the following interpretation : 

W2(y1, Yo; t) dy, dyz=the joint probability that at some (initial) 
time ft), y (=y;) lies in the range (y,, y,+dy,) 
and at a later time t)-++#, y (=y,) falls in the 
interval (y, y2+dyz). , 4 (1.4) 

Because time and ensemble averages are equivalent here, equation (1.3) 

becomes 


RO=ve= || viyWalys Yas Ui dyn - . . (1.5) 


and since the process is stationary, the initial times ¢, do not enter: one 
is concerned only with the time intervals (¢) between observations. 
Knowledge of the correlation function R(t) is important, for by the 
theorem of Wiener (1930) and Khintchine (1934) the mean intensity 
spectrum follows at once as the cosine Fourier transform of R(é), namely 


wif=4 | Rit to Inf hres (6.a) 
10) 
with the inverse relation 


R= | W(f) ¢08 wt df. Peat (1 6b) 


* We distinguish here between time and statistical averages by ¢ )ay and 
the superscript bar(—), respectively. The equivalence of these averages 
follows from the ergodic theorem (Chrandrasekhar 1943, James, Nichols and 
Phillips 1946). 


S82 


692 David Middleton on the Distribution 


To determine the desired energy spectrum W/(f) the simplest procedure 
is first to obtain the correlation function and then apply (1.6 a). Note 
from (1.6 5) that setting t=0 in R(t) gives the mean total intensity of the 
random wave, namely 


- 00 - p+ oo — 
R(0)= | ’ W(f) df a | J nysWaly Yo; t) dy, dyz=y", . (1.7) 


which is the area under the spectral distribution curve W(f), as expected. 
On the other hand, allowing ¢ to become infinite in R(t) yields the steady 
component y, since lim W,(y;, Yo; t)=Wi(yi)Wi(y2), so that (1.4) 
becomes ite 


lim R(t)= || YY 2W 1(Y1)W (Yo) dy; y2=Y1 -Yo= Ys» - (1.8) 
t> —o . 

from (1.1). For a pure noise wave y vanishes, as there are no steady 
components. However, when y does not represent a purely stochastic 
variable, but contains steady and periodic terms as well, lim R(¢) will not 


t> ao 

die down in time, but will oscillate indefinitely. If R(t) is then expanded 
in a Fourier series, the coefficient of each periodic component represents 
the mean power (or energy, as the case may be) associated with that 
component ; setting t=0 in R(é) still gives the mean total power in the 
wave. 

In a similar way we may find the correlation function for a general 
function g(y) of the random variable y. By definition (Rice 1944, 1945 
Middleton 1948a, 1949) we have 


R()= (gly (to) gly (to+8)] av=9(Y1)9(Y2) 


ae 
= | a IY1)9(Y2)Wol(Y1, Yo; t) dy, dyg. . (1.9) 


The spectrum follows from (1.6a@). For the problems (i.)-(iii.) of the 
present paper the modulated wave V(t) is expressed as a function of a 
statistical variable y, and the choice of g(y) is based on the pertinent 
physical model which describes the problem. In general, g(y) is not a 
linear function of y, and so the evaluation of the auto-correlation function 
becomes difficult. 


2. RESULTS AND CONCLUSIONS. 
We consider first 


(a) Frequency- or Phase-Modulation of a Carrier by Noise. 


We start here with the following narrow-band wave : 


Hy)=V()=Ao exp {ifot+P()]}, Y=P@)). . . (2.1) 


of Energy in Randomly Modulated Waves 693 


In (2.1) Ao is the peak amplitude of the modulated carrier exp [tw f+7¥(0)], 
wo (=27fo) is the central or resonant (angular) frequency of the inter- 
mediate (IF) or radio frequency (RF) band of the apparatus in which V(t) 
is generated*, and Y(t) is a varying phase, which has the form 


Yi)=Dit), or YP)= [ Dey ar, 2. (2.2) 


respectively, for phase- or frequency-modulation. The amplitude A, and 
modulation Y are real quantities, so that the instantaneous voltage or 
current wave is given either by the real or imaginary part of the complex 
disturbance V(¢). For a moment we leave open the precise nature of Y(t), 
and hence of D(t), except to note that Y(é),,,, must be much less than 
wot, and P(#)nax= (dP /dt)(t)|max considerably smaller than w, for Y(é) to 
represent a true phase- or frequency-modulation (van der Pol 1946). 

By a slight modification of (1.3) to account for the complex character 
of V(t), the auto-correlation function R(t) is here 


R(t)=3 Re [V(t) V()+4)*] 
A2 
s = Re {exp (—twot) {exp [oP (t))—tP (total hstatavt- - - - (2.3) 


For the present discussion D(t), and therefore Y(t), belong to a normal 
random process, since the modulation D(¢) is fluctuation noise (Rice 1944, 
1945, Wang and Uhlenbeck 1945, Middleton 1946, 1948 a,b, 1949), and 
may be written explicitly as 


D()=D,V(t)y, with D)=r.m.s. phase (or angular frequency) 
deviation/r.m.s. modulating voltage, (2.4a) 


since one has D(t)?=D?V(t)x. Consequently, for frequency-modulation 
and phase-modulation an r.m.s. (angular) deviation and an r.m.s. phase 
deviation can be defined respectively by 


- w= D(t?=D2V(i)% and 63=D2V(t)x, 
where, in the former, D, is measured as frequency/volt, while in the 
latter instance D, has the units (volt~1), so that 03 is a pure number. 
Expressing the auto-correlation function (2.3) now in terms of the 
statistical average, we use the joint second-order probability density 
(cf. (1.4)] for the pair of normal random variables Y,='¥(f,) and 
W, =W(t,+t), which is well known to be (Rice 1944, 1945, Middleton 
1948 a, 1949) 
Wal(¥r, a3 = (Qa P2y/[1—p)) 
exp [—(Y7+ Pe—2o(t) Pi, P2)/2P%(1—p*)], - - - (2.44) 
* It is assumed that any local oscillator noise, generated in the process of 
mixing the modulation and carrier, or background noise, accompanying the 


carrier or signal, is heavily suppressed relative to the principal modulation 
product V(é), (2.1). 


694 David Middleton on the Distribution 


where ¥2—= Y2—¥?, and p=p(t)= W\W,/¥2. The correlation function 
(2.3) is finally 


R(t)= Re {exp (—iw t+ PFs Fy} — 2 Dh ewes: —D?2Q(t)] cos wot, (2.5) 


with Q(t)=(¥—¥,¥,)/D2. & 
For phase modulation by noise Q(t) is found, on applying familiar 
Fourier integral methods (Madelung 1936), to be specifically 


Q,y(t)= [Waal £01 —e08 wt ap’ Ul? Raut 


The spectral intensity W,.(f) of the phase-modulating noise Vy(t) may 
be obtained directly from (1.3) and (1.6 a) on application of the Wiener- 
Khintchine theorem. In a similar manner we find for the more common 
case of frequency-modulation equation (2.2) that now 


Quy(t) ={- Wont (f(1—c08 wt dfici#.: a at areas 


From (1.6 a) and (2.5) the Bieeenonaieg mean power spectrum for the 
modulated carrier is 


W(f)=A | ‘ dt{cos (coy-+«)t-+ 0s (w)—w)t] exp(—D3Q(t)), (w=2nf), 
a oe (eK 


snowing that the original carrier is either wholly or in part ‘‘ smeared 
out ” into a continuum, which is symmetrical about the central frequency 
So: 

The mean total power W, in the wave is A?/2 and the mean power 
(Wer) remaining in the discrete components after modulation is given by 
the oscillating portions of the correlation function in the limit t + oo (Wang 
and Uhlenbeck 1945, Middleton 1948 a, 1949), and of this the coefficient 
of cos wt yields the power W,, remaining in the carrier. Consequently, 
the mean power W, in the continuous part of the spectrum is 
W,=W,—Wy er Since we are assuming here modulation by a pure noise, 
then lim £2(¢) does not oscillate indefinitely but approaches a steady value, 


t+ 0 
which may be zero, finite (>0), or even infinite, and the only discrete 
component is the carrier itself. We may write, therefore, 


A2 
Wre= Wy = oy exp [—D}22(00)], and 


W,=W,—W,,= 99 (1—exp [—D22(00)]) gts gay F028) 


of Energy in Randomly Modulated Waves 695 


for the mean power in the continuum.* For phase-modulation by a 
purely random noise V,(t), Q,y(00) is a finite constant. The case of 
frequency-modulation is less simple. We investigate the behaviour of 
Qry(co) by expanding W,y(w) in a power series about w-=0, viz. 
Wy (w)=C)+Cia+C,w?+..., and substituting the resulting series 
into (2.6 b) to obtain finally, on integration by parts, 


Qy~( 00) =O (= —C, log »—C,+C,o+.. shel h . (2.9) 
We distinguish three cases : 
ej 0s Qyy (co) > ne (2) he or. 0(é)35 
iC =0; C0: Qyy( co) > +00, 0(—logw),,,9, or O(log zt), 4; 
HiT. Cg=C, =0, C,40: Qpy(0o)>a@ constant (>0). . . . (2.10) 


Thus, if Wyy(0) be finite, 7.e. Cy>0, or if Wy(0) vanishes no more 
rapidly than 0(w) (2. e. Co=0, C,>0), Qyy(co) is positively infinite and, 
therefore by equation (2.8) all the wnmodulated carrier power is distributed 
among the components of the continuum, and no energy remains in the 
carrier. On the other hand, if the modulating noise contains no harmonics 
at or near f=0, so that W,y,(0) vanishes sufficiently rapidly as f— 0, 
1. €. Cg =C, =0, C, 40 at least, then 2,,(00) may be a finite constant, 
and we have effectively an example of the phase-modulation type 
discussed above, where some power remains in the original carrier. 

The different spectral distributions obtained in phase- and frequency- 
modulation are explained by the fundamental observation that the mean 
power associated with any frequency interval is directly proportional to 
the time the wave spends on the average in that interval. Therefore, in 
the instance of frequency-modulation, when the carrier is deviated at a 
very slow rate (with a fixed maximum or r.m.s. swing) corresponding on 
the average to frequencies at and very close to w=0 [¢. e. Wpy(0) >9], 
it spends an infinitely longer time at the limits of each swing than near 
the central, or unperturbed, frequency, namely, near fy+D)/27 rather 
than about f,. Since the power in the wave is finite, the ratio of the energy 
distributed about f)+D,/2z7 to the energy in the spectral vicinity of the 
carrier is infinite: there is consequently no contribution to the carrier 
from these modulating components. Similar remarks apply for the 
higher modulating frequencies, except that now the energy (or power) 
ratio is finite, since the slowest possible rate of excursion through the 


* However, when lim Q(t) oscillates indefinitely, because of periodic com- 


too 
ponents in the modulating wave, we may obtain the discrete portions of the 
spectrum (2.7) by expanding exp [—D?Q(t)] and collecting the periodic 
terms which appear in Q(t), Q(t)?, etc. Combining these with cos (wo + w)t 
according to (2.7) then gives an (infinite) set of discrete harmonics, which are 
located in frequency at fy+f, (where f,, represents a typical harmonic of the 
periodic part of the disturbance) and whose intensities are the coefficients of 


COS (wy + Nwg)t. 


696 David Middleton on the Distribution 


allowed frequency range cannot be less than the lowest modulation 
component. Furthermore, it is always the lowest modulating frequency 
which effectively determines the ratio of power at the limits of the swing 
relative to the power distributed about the centre of the band. The contri- 
butions of the higher frequencies are of secondary importance, since the 
time spent in any spectral region in the course of the frequency modulation 
is essentially infinitely longer for the very slow and zero, or “ adiabatic ” 
shifts than for the displacements occurring at greater ratio. Accordingly, 
when the modulating noise contains (nearly) zero-frequency components 
(i. €. Wyy(0)>0), the mean power in the carrier and in the continuum 
is completely specified by the behaviour of W,,(f) at and in the 
immediate vicinity of f=0. Similarly, when the modulating noise 
contains a band of frequencies, for which there are no zero- or very low- 
frequency terms, a finite amount of power remains in the carrier. 

The same argument applies for phase-modulation. We have only to 
observe that the phase modulation can be replaced by an equivalent 
frequency-modulation, cf. (2.6 a), where the spectra of the two disturbances 
are related by Wyy(f)=w?W,.(f). Since W,.,(0) may exceed zero, it 
follows that the corresponding spectrum Wyy(f) is 0(f7) as f+>0, at 
least, and consequently W,,(f) represents a band of noise. This is 
sufficient to give a finite ratio of mean side-band to mean carrier power, 
which by (2.8) is explicitly exp (D?é.)—1. 

A number of general observations concerning the spectrum of the 
modulated wave can now be made. For all ratios (w,/w,) of the r.m.s. 
(angular) frequency deviation to the r.m.s. (angular) rate of deviation 
(or w,), or for all magnitudes of the r.m.s. phase change @,, it is possible 
to develop a series expansion for the mean spectrum. This is obtained 
from (2.7) once the explicit form of Q(t) has been determined. However, 
it is sometimes possible to develop in series directly, without first 
determining {2(t). In the case of frequency-modulation we find that 
(2.7) becomes* 

A , co (— D2)” 
W(f om =Ap exp [—Dp Qpy(10)] 2 3 


PES Se Os. 


x [£2 p)— yy <0) cos (wwy—w)tdt, . . (211) 


provided {2py( 00) exists, which by (2.10) will occur only if the modulating 
noise is a band with a lower cut-off frequency which vanishes, as f — 0, 
sufficiently rapidly. On the other hand, for phase-modulation we find 
at once from (2.6 a) that a general expansion in powers of 0,, the r.m.s. 


———e—e————o—o—— 


* Only when f~/f, is there any appreciable contribution to the integrals in 
(2.11), since the modulation is a narrow band about f=f. Accordingly, the 
terms in COS (w)+-w)t are quite ignorable, as they oscillate far too rapidly (when 


f~fy) to make a noticeable addition to the resulting spectrum, distributed 
about f=fp. 


of Energy in Randomly Modulated Waves 697 


phase deviation, can be effected. Again from (2.7) the spectrum of the 
modulated carrier becomes 


W(fom =Az exp (—62) 3 2a [ | felt)” cos (4 —w)t dt, . (2.12) 


where 7,(t) is the normalized auto-correlation function of the modulating 


noise Vx(t), namely Vy(to)Vx(fo+t)/Vx(t)2, which by equation (1.6 b) is, 
in terms of the spectrum of the modulation V(t) 


> 


ral) =| Wi flea. e0s ot df / |” W Aa A 


When the r.m.s. deviation is large compared to the r.m.s. rate of 
deviation, cr equivalently, when the r.m.s. phase variation is great, the 
series (2.11) and (2.12) above do not converge rapidly enough to be 
useful, and so an (asymptotic) expansion in inverse powers of w/w, or 
6, is required. Such an expansion is easily obtained, provided now that 
Q(t) possesses all derivatives at t=0, for then we can develop Q(t) in a 
MacLaurin series about t =0, retain terms in ¢ and ¢? only in exp [— Q(4)], 
and expand the rest in a series. We obtain finally the mean intensity 
spectrum of the modulated carrier 
WiN= pane) perpen 8 
oD) 12 De4 ! Q@)2° 


: $8) QwH2 
% Ds | sro f'(B) + Taaeon $18) 


] QV) OQ” OL6) Qs 
oe Di | sas p'9(B) + Sta one fLo(B) + ae, 716) | 


1 Q(10) l Q62 29H Qe) 
TD; Ee roms SR) + som. ( ere nr) p(B) 


QE)2O(6) Qi 
+ staper oer F(P)+ Fapm $3946) bid a 


ld ak a Ta SP eB eae) 
where 
f(a) ae ee ix 
and in which specifically 
Qn =0 ; OR | Walt) df =bo, (~@») ; Do( 2°)! =Dov/(o) Sea 5 
B=Byy =(Wo—w)/was - - + (2.144) 


28, by; OBL=| Woulflo* df =bs, (~o) ; 


D,( 2) =DyV/b2 =0av/(b2/bo) soe) =Pou =(w)—o)/y Ga; v= a/(b3/59) ; 
| (2.146) 
bez [ orWifdf; . . -.~ . (214e) 


0 
2, DEN) —(—1)"-1be,_2, (N21); QGP =(—1)"-1b2,, (n>0). . (2.14 d) 


698 David Middleton on the Distribution 


Note that as the mean intensity of the modulating noise (~w,) becomes 
very small, or as the r.m.s. deviation (~w,, or 44) becomes very great, 
the other parameters of the system remain constant, one always approaches 
a gaussian modulation spectrum, quite independent of the precise power 
distribution of the modulating wave (in all cases for which (0) exists). 
As can be seen from (2.1), (2.3)-(2.5), (2.7), and (2.13), the gaussian 
limiting form is a direct consequence of the normal statistics of the 
modulating wave and the fact that V(t), equation (2.1), is a true frequency- 
or phase-modulated carrier, i. ¢. there is no amplitude distortion of the 
carrier in the final output after modulation. For non-normal random 
noise such as precipitation or impulsive static, W, will not be a normal 
frequency-function and consequently the limiting spectrum will no longer 
be gaussian. 

We are ready now to summarize the results for specific examples of the 
gaussian modulation spectrum : 


(i.) Frequency-modulation. 
In this case the gaussian spectral distribution of Vy(¢) is given by 


Woaulf)=Wo exp (—@7/a7), se 2 ete) 


and therefore, according to (2.6 6) and (2.7), we have finally the spectrum 


A2 2 
Wu = = | dx cos Bx 
w pe 
exp| — V(r) “ {76(a/2)— Ta ll—exp (—ati4y1} 
= eo aes 
a 


To evaluate the integral we approximate the exponent, /;(x), by a suitable 

function f,(x) which fits the true function f,(#) closely ; in this instance 

we require that f,(~) have the same values as /;(#) at and near x =0 and 

as «-> oo. Since lim f(x) =a—2/,/(7) we seek a form of f,(a) which 
t—> co 


facilitates the integration of (2.16) and which consequently does not 
contain an exponential of argument other than x. Our choice is 
f.(v) =“—2/7 + (aa-+-b) exp (—cx). The above conditions on f,(x) give us 
finally a—be =—1 ; b=2/4/(7) ;_ be?/2—ac =1/24/(7), which yield 


hs _ 2, fai pan 0-672) _ b 
7 /(ar) on ee OHO79 fit gd a 


_Vim)+v(7—2) 1-481 
pote | gree yoo oo Sie efits MESS Sete aad) 


It is found that c,, and accordingly a,,., give the best fit, being at the worst 
about 7 per cent too great for a~1 and proportionately better for smaller 


of Energy in Randomly Modulated Waves 699 


and larger x. Substituting these results into (2.16) and expanding the 
exponential containing exp (—cz) gives finally the mean spectrum 


= AG 2); 9 2 “|r n = 
W(f)eu= os exp (2w7/w7) x sees 5 4, (Cs B =) 


n=0 n! 
x Re[i-*@)(B+)V/[4/(a) JaBlo?-boan|et, se eR) 
ene =(W9—@)/w pL /(7)wj/@5-+e ,n], 


where BO(q) =m !4™/(1—ia)™+1 ; Im(a)>—1. 
When w;/w; is larger than unity we may use the asymptotic develop- 
ment (2.13); in this instance we observe from (2.14) and (2.15) that 


oem —(—D™ *(2m—?2)! 
BM 4/(x1)22™(m—1) ! 


and so, after some reduction, (2.13) becomes explicitly 


Wife 2 { oye") + + 3 (2!) 400" 


[Wigco; ease 1 


3 =) [24G(9(B") + 5g'9(B" 1 (22)" rrosog@(g' 
ui ( +5S(BI+ seq) (S2)" Losogre' 


+ 504g (8) 435408 1-+ sorezp; (4) [20160420(8" 


+ 168486228’) +.2520609(B’) +175g19(B)]+ . . a 


wylog<l; p=. ee (219) 


Fig. 1 illustrates a typical frequency-modulation spectrum for a number 
of values of the ratio w,/w, for spectra normalized to unity (at B =0). 
We remark that the asymptotic expression (2.19) can be used with fair 
accuracy for values of w,/w,—1; the spectral intensities for B>2, 
however, become progressively too low as f increases. 
(ii.) Phase-modulation. 

If we substitute the expression (2.15) for the mean intensity 

distribution of the modulating noise into (2.6 a) we find at once that 
Qou(t) =67[1 —ro(t)], with ro(t) exp (—w3t?/4), 
and therefore the mean spectrum of the modulated carrier becomes 
Ag she “2 Lacy. A/a Og 
oe a 
Pe a) 


exp tte eahte ser ey C4) 


Wf Jom = 


700 David Middleton on the Distribution 


Fig. 1. 


SCALE FACTOR 
BY WHICH ORDINATE 
MUST BE DIVIDED 


TO GIVE UNNORM-. 


B Jaro] 
a 


Normalized spectral distributions of a carrier frequency-modulated 
by normal random noise. 


‘ 


Fig. 2. 


SCALE FACTOR 

BY WHICH ORDINATE 
MUST BE DIVIDED 
TO GIVE TRUE VALUE 


_ [oI] 
Wp 


p 


Normalized spectral distributions of a carrier phase modulated 
by normal random noise. 


of Energy in Randomly Modulated Waves 701 


As explained in (a) above, a finite amount of power, (A?/2) exp (—63), 
remains in the carrier. Again, when 6, is much larger than unit it is 
convenient to use the asymptotic development (2.13), for which now 


(2; lS eae : 
Cont! = a/ (71) 22™+2 77 | oer 


One has finally 


2or Gee 5 
Wife AYO { ovary 2 gosyar) + 1 gp aago(p” +-396(8") 


re " ") re 
H ag 7 [8PP(B") + 8h0(B') +.B") 
315 67 8196 (10)( As D) (12)( oy (14) Q’" 
+ 3701 10! ¢@ [966% (B’") +2006 (8’’)+6067%(6’’) 
+15¢79(8")]-++ ae PET (2.21) 
04 
The carrier term A?/2 exp ( (f—fo) is concealed in (2.21); when 


ww, the asymptotic series anus in the proper way to give the correct 
carrier power, for note that from (2.21) i W(f)oudf = Az/2 ; only the 
0 


leading term makes a contribution. Compare with (2.20), which is 
valid for all @;. Fig. 2 illustrates a number of representative spectra 
and, like fig. 1, here shows spectra normalized to a maximum value 
of unity. 

As expected, large modulation indices yield correspondingly large 
spectral spreads. In the case of frequency-modulation (fig. es one gets 
in the limit of very small deviations or very rapid swings (7. e. (w7/w,)? <1) 
a very narrow spectrum, which becomes essentially a discrete component 
at the carrier frequency (f)) when w,/w,>0. This is explained by 
observing that, since the (r.m.s.) rate of deviation (~w,) is now so rapid 
relative to the (r.m.s.) interval (~w,) over which the frequency excursion 
takes place, a vanishingly small fraction of the wave’s energy in any 
spectral region (proportional to the time the wave spends in that region) 
is distributed outside the immediate neighbourhood of the carrier fy. 
’ The case of phase-modulation (fig. 2) exhibits a similar behaviour, but 
with a number of important differences. There is now effectively a finite 
lower bound to the width of the equivalent frequency-modulating noise 
which has a spectrum W(f)~w? exp (—w?/w3), as compared with the 
usual frequency-modulation case above, where W/(f)~ exp (—w?/a;), 
as a consequency of the fact that by definition the wave-form of the 
modulations for the same noise disturbances are related as integrand 
and integral (cf. equation (2.2)). This in turn sets a lower limit to the 
spectral width for phase-modulation by noise, which approaches in 
shape the spectral distribution of the modulating wave, here 
~ exp (—w?/w?). However, since this limiting spectral intensity is 


702 David Middleton on the Distribution 


proportional to 6? (63<1), the spectrum vanishes when 6, goes to zero, 
and we are left with the original, undistorted carrier. Frequency- 
modulation yields a greater spectral spread than phase-modulation 
(for the same deviation and identical noise waves) whenever the (r.m.s.) 
modulation index (0, or w,/w,) is of the order unity or greater, as 
comparison of figs. 1 and 2 shows. The reverse is true for indices less 
than about unity. The phenomenon is explained by the fact that the 
modulating frequencies of maximum intensity in the FM case lie near 
(w=0), while for 4M they fall in the neighbourhood of (w~w,) (since 
the spectrum of the equivalent frequency-modulating wave is proportional 
to w exp (—w2/w?). Accordingly, the phase-modulation cannot sweep 
the carrier through a given deviation more slowly than a frequency of 
order w,, and this sets an wpper limit on the amount of power available 
in any spectral region. We therefore expect less spectral spread in 
phase-modulation (for sufficiently large indices) because of the effectively 
greater rate of sweep. For small indices the reverse is true. 


(b) Carrier Amplitude-modulated by Noise. 


As in the preceding examples of frequency-modulation (cf. § (a)), we 
represent the IF (or RF) wave by a complex disturbance 


gly) =V(t) =Ao(t) exp (iargt), (wp =2rf), . «(2.22 a) 
where A,(t) is a real quantity. The amplitude modulation is specifically 


A(t) =Ag(1+kVy(t)), y=kVy(t)>—1 } 


SH) y=kVs()<—1 ant 


in which Vx(¢) is a normal random noise voltage (or current) and k is a 
modulation index, with dimensions (volts)-!. When the instantaneous 
amplitude Vy(t) is less than —1/k, over-modulation occurs, and the signal 
generator does not oscillate until Vy(t) is once more greater than —1/k. 
Since we assume a purely normal random noise, large, and even infinite, 
amplitudes are possible, and consequently we may expect over-modulation 
for a noticeable part of the time, unless the modulating noise is weak. 
The analysis of this and succeeding sections assumes the common type 
of modulation in which the instantaneous amplitude of an oscillator’s 
output is modified according to some signal or other low-frequency 
disturbance applied to a suitable control grid. Frequently, however, 
a modulated output is produced by applying the swm of the separately 
generated oscillations and the modulation to the input of a (half-wave 
linear) rectifier. The tube acts now as a mixing device, which yields a 
suitably modulated output carrier wave only if the original carrier 
oscillations are very intense relative to the modulation. Otherwise one 
obtains serious distortion due to the significant additional harmonics 
generated in the non-linear mixing of the carrier and the background 
noise. Thus, if a mixer is used, equations (2.22) and (2.226) apply here 


of Energy in Randomly Modulated Waves 703 


approximately, provided the modulation is weak, while (2.22 a) and (2.22 b) 
are valid models for all degrees of carrier and modulation strengths when 
the alternative system of a modulated oscillator is employed. 

We consider now the general case (2.226), which includes over- 
modulation. To represent the discontinuities in Aj(t), cf. (2.225), 
express A,(#) in terms of its Fourier transform 


he 

A(t) = 5 =| z%dzexp {+i2[1-+kV,(t)]}, - . . (2.28) 
dis Ct 

where C is a contour extending along the real axis from — co to + co and 

is indented downward in an infinitesimal semicircle about the singularity 

at z=0. The correlation function (2.3) is now 


Ri) =PR rol ea Sa ze 


1% 2 exp {—i€[1+kV x (to +0] eg e224) 
stat av 

since no coherence between carrier and modulation is assumed. Here C* 
is the contour conjugate to C, extending from + co to — o0, and is indented 
upward in an infinitesimal semi-circle about the point €=0. The ensemble 
average in (2.24) may be effected in a straightforward way if we note 
that since Vy y(t) (=V,) and Vx(t+t) (=V.) are normal random 
variables, their joint distribution is given by a relation similar to (2.4 a). 
Observing that the statistical average yields the characteristic function 
(cf. equation (2.16) of Chandrasekhar (1943)) for the noise : 


F(z, £3 ty =exp [— 34° (0) + &)—28h(1)] 
we have finally 


2 lot) ( dl j2— kp2?/2) 
RW) = A2 Ai Re at (1a an zexp (iz—hp 


477? 2 


oe fig —ioyer/2—Eayloee] 


more kb 
COS w eo ZL EolOl yp n COS Wol, ee ee Aa) 
ee n. 


272 
_AG he 
where 


n= =| 20-2 exp (tz—h%pz2/2) dz =—i-"(kap)t M2292 
’ TT Cc 


ea: ~a)/"(=) 


ac Ce 7 al) 


704 David Middleton on the Distribution 


from equation (A 3.17) of Chandrasekhar (1943); ,F, isa confluent hyper- 
geometric function. Specifically (cf. (A 3.9) of Chandrasekhar (1943)), 
we have for the amplitude functions ho, 7: 


ho, oH + O(1/V/(2k2p)]-+-V/(kp/2) exp (—1/2K%))}, «(2.27 4) 
ho, et ie 2hb)]}, os reeriacaheie (2.27 b) 

ho, » (122) PPG] /(RP)], (= B, 8, 4, 5...) 
(2.27 c) 


where O(7)=2/4/(7 7) exp (—y?)dy; 6™ (x) = exp | —x?/2)/4/(27)),n>0. 


(See also Appendix IT. of Chandrasekhar (1943).) By equation (1.6 a) the 
mean intensity spectrum is the Fourier transform of (2.25), which becomes 
explicitly, when the modulating noise has a gaussian spectrum, cf. (2.15), 


ae (TARE [ 53 (w—w)? 
W(f) = S288 9 3(f—fa)+ aint {1 exp ( ent) 
io) pp sis ae et (w y—w)? : 
pty a 2 

Fig. 3 shows typical ee Bpeois a a number of values of 4/(2k*:b) 
between 0 and oo. 

When essentially no over-modulation takes place the correlation 
function reduces to 


= 
[4/(2h%b)>0] : R(t)= cos wot. {1+kebry(t)}, (k2b<0'18 for less than 
1 per cent i oltre 2 ve MeO Ganon 


Additional correction terms may be found in straightforward but tedious 
fashion. We note in fig. 3 that the mean intensity spectrum is here more 
widely distributed about fy than for the case of ignorable over-modulation. 
The additional harmonics are (carrier x noise) noise products, stemming 
from the clipping process inherent in over-modulation. However, unlike 
the examples of frequency- or phase-modulation discussed earlier in 
section (a), there is a limit to the spread of the spectrum, determined by the 
fact that over-modulation can occur at the maximum but 50 per cent 
of the time. On the other hand, since the amount of clipping inherent in 
the weak modulation cases is ignorable, no significant spectral spread 
is obtained there, vide fig. 3. In any case the spectral spread is relatively 
small. 

Whereas the spectrum requires a series development, the total mean 
power W, and the total intensity W, of the continuous part of the 
disturbance are easily obtained in precise, closed form from equation (2.22 b) 
if we remember that kV y(t) =y is normally distributed with the first-order 
probability density W,(y) : 


1 
W,(y) TL AVDA ee (—y?/2kp), +a. fe) Sen gee (2.30) 


of Energy in Randomly Modulated Waves 705 


The power in the carrier after modulation is (cf. (1.1) and (1.2)), 


Az} 2 a 
W, =Z 1) G+nWiedy| =78 
ra | a 2 
A2 (14 Of1/4/(2k2 
=< Sesoayyaeny}. . ean 
and 
2 ro 
W.= (0) = (1+y)2W,(y) dy Se a ee, NERY) 


i 
= on 


) envi (2kY) +1} — legate |, 


Fig. 3. 


CORD.) = 
ABSOLUTE 
1/2 RD. “I/F 
(2x? yl NS OdvoR. i 
NORMALIZATION 
FACTOR F : 


= MEAN MOD. NOISE POWER 
= MODULATION INDEX 


Mean intensity spectrum of carrier amplitude-modulated by noise. 


and so, 


W,=W,—-W, = =|; + fon veew+1} 
be {ive every yao) |. (2.33) 


SER. 7, VOL. 42, NO. 330.—JULY 1951 oe 


, 


706 David Middleton on the Distribution 


Fig. 4 illustrates the behaviour of the mean intensities W, Wie Wetor 
different degrees of over-modulation. As the intensity of the modulating 
noise is increased, a correspondingly greater proportion of the modulated 
wave’s power is distributed in the (noise x noise) noise sidebands generated 
in the process of modulation and the clipping due to possible over- 
modulation, as shown in fig. 4. We remark that the amount of power in 
the carrier component and in the continuum is quite independent of the 


Fig. 4. 


= MEAN TOTAL POWER 
= MEAN CARRIER POWER 
= MEAN CONTINUUM PQWER 


= -W 
Wes We. 


/2 


2 
0 


MEAN power /\ 


= MEAN INPUT NOISE POWER 
= MODULATION INDEX 


Vane 


Mean power in carrier amplitude-modulated by noise. 


particular spectral distribution of the original (normal random) noise 
and depends only on the clipping level at which over-modulation occurs, 
since power in a given band of frequencies is proportional to the integrated 
intensity of the band. One can consider also more complicated 
modulations, such as square-law modulation, viz., Ao(t) =[1 +kVx(t)]? : 
the treatment is identical with that of the linear case einineiihare 
following an appropriate modification of the transform relation (2.23). 


of Energy in Randomly Modulated Waves 707 


REFERENCES. 


Bennett, W. R., 1944, J. Amer. Acoust. Soc., 15, 165. 

Boontmovicy, V., 1946, J. Physics, U.S.S.R., 10, 35. 

CHANDRASEKHAR, 8., 1943, Rev. Mod. Phys., 15, 1. 

MippLeETon, D., 1946, J. Appl. Phys., 17, 778; 1948 a, Quart. J. Appl. Math., 
5, 445; 1948 b, Proc. Inst. Radio Engrs., 36, 1467; 1949, Quart. J. Appl. 
Math., 7, 129. 

VAN DER Pot, B., 1946, J. Instn. Elect. Engrs., 98, III, 153. 

Ricz, 8. O., 1944, Bell Syst. Tech. J., 28, 282 ; 1945, Ibid., 24, 46. 

Van VuEcK, J. H., and Mrpp.eton, D., 1946, J. Appl. Phys., 17, 940. 

Wana, M. C., and UnLENBECK, G. E., 1945, Rev. Mod. Phys., 17, 323. 


For primarily mathematical references, see 


Buanc-LapierRe, A., 1945, Dissertation, University of Paris. 

Crammer, H., 1940, Ann. Math. Ser., 2, 44, 215; 1946, Mathematical Methods 
of Statistics (Princeton University Press). 

James, H. M., Nicnous, N. B., and Patures, R. 8., 1946, Theory of Servo- 
mechanisms (M.1.T.) (Radiation Laboratory Series), 25, (New York: 
McGraw-Hill), Chap. 6. 

KaurIntcuHineg, A., 1934, Math. Annalen, 109, 604. 

Mave tuna, E., 1936, Die Mathematischen Hilfsmittel des Physikers (Berlin : 
J. Springer). 

Wiener, N., 1926, J. Math. and Phys. (M.I.T.), 5, 99; 1930, Acta Math., 
55, 117. 


Oo 
Q 
N 


[ 708 ] 


LXXII. Observations on the Multiple Scattering of I onizing Particles in 
Photographic Emulsions —Part I. The Value of the Scattering Constant. 


By K. Gorrstern*, M. G. K. Menon, J. H. MuLveEy, 
C. O’CeaLLareHt and O. Rocwat, 
H. H. Wills Physical Laboratory, University of Bristol f. 


[Received May 10, 1951.] 


SUMMARY. 


The theories of multiple scattering of Williams, Moliére, Snyder and 
Scott are briefly surveyed. The results of five independent determinations 
of the scattering constant for G.5 nuclear research emulsions, details of 
which are to be presented in the following papers, are compared and 
discussed. The constant has been found to vary with cell-size and f in a 
manner which is in agreement with theory. The effect of rejecting large 
angles due to plural scattering is considered, and also those errors which 
arise from the presence of spurious scattering. 


§1. STATEMENT OF THE PROBLEM. 


THE method of estimating the energy of fast ionizing particles in photo- 
graphic emulsions by measuring the deviations in their tracks produced 
by multiple scattering was first suggested by Bose and Choudhuri (1941), 
and is now one of the standard techniques of applied nuclear physics. 
Its most fruitful application has been to the study of cosmic radiation. 
The analogous method has also been applied with success to the tracks 
of electrons in the Wilson cloud-chamber (Groetzinger et al. 1950). 

During the past two years several papers have appeared dealing both 
with the theory and with the experimental applications of the method 
(see References). In particular, following an important advance in the 
technique described by Fowler (1950), a series of publications has 
appeared from this laboratory. Certain of these papers describe how 
the scattering method has been used to establish the mass of those 
particles of great energy which form the charged component of the 
penetrating showers (Camerini et al. 1950). 

The scattering method has also been used in experiments which 
demonstrate the presence in these showers, of neutral mesons of short 
life (Carlson, Hooper, King 1950), and again, in a general study of the 
properties of the soft component of cosmic radiation (Hooper and King 


* On leave of absence from the Max Planck Institut fiir Physik, Gottingen. 
+ On leave of absence from University College, Cork. 
t Communicated by Professor Powell. 


On the Observations on the Multiple Scattering of Ionizing Particles 709 


1950, Hooper, King and Morrish 1951). The somewhat different 
technique developed by the Brussels group (1950) has also been used 
successfully to study .a wide range of similar problems. 

The mean deviation of a charged particle which passes through a given 
layer of matter will later be shown to be directly proportional to the 
charge, and inversely proportional to the product momentum x velocity. 
The constant of proportionality depends chiefly on the composition of 
the scattering medium. Thus, to calculate the energy and momentum 
of a particle from the observed mean angle of scattering in a given 
emulsion, we must know the value of the “scattering constant ” 
appropriate to the experimental conditions. : 

Values of this constant which show fair agreement with each other may 
be calculated by using various theories. On the other hand, there appears 
to be definite disagreement between previous experimental determinations. 
Since the chief use of the method is to estimate the energies of ionizing 


TABLE [. 


: Energy in MeV. A a 
ERO yond for loss) | p Origin 
1. Positrons 105 1 | Cornell Synchrotron 
2. Positrons 185 1 | Cornell Synchrotron 
3. Protons 336 0-46 | 184 in. Berkeley Cyclotron 
4. Protons and 5-50 MeV. 0-07 | Cosmic ray particles ending in| 
mesons emulsion 
5. Protons 9-35 MeV. 0-02 | Protons ejected from cosmic} 


ray stars 


particles, it has been felt desirable to obtain an experimental value for 
the constant applicable to the conditions of the work in progress in this 
laboratory. At the same time, because of its intrinsic interest, it was 
hoped to compare the experimental value of the constant with that 
predicted by theory. 


§2. Scope oF PRESENT EXPERIMENTS. 


The calibration of Ilford G.5 emulsions has been carried out in two 
independent ways. In one approach, the measurements of scattering 
were made on the tracks of artificially produced monoenergetic particles. 
In the other, particles were used which had their origin in cosmic-ray 
events. Those selected came to rest in the emulsion having travelled 
such distance that their mass and energy could be determined accurately. 

The above Table lists the particles on which scattering measurements 
have been carried out, and shows the corresponding values of their energy 
and of the parameter 6? =v?/c?. 


710 K. Gottstein et al. on the Observations on the Multiple — 


As will be seen later, the value of the scattering constant is dependent 
to some extent on the velocity of the particle. Since the measurements 
above include the complete range of velocities from the classical to the 
extreme relativistic, they may be used to test experimentally whether 
the velocity dependence predicted by theory is correct. 

This programme of work has been carried out as three separate 
investigations. In the present paper, however, it has been thought best 
to discuss the results taken as a whole. In this way we avoid repeating 
definitions, and the argument will be the clearer for the absence of 
distracting experimental details. In the four succeeding papers of this 
series the experimental procedure will be fully described. 


§3. ResumE or THEORIES OF MULTIPLE SCATTERING. 

We deal briefly with the theoretical treatment of the problem of 
multiple scattering in thin foils given by Williams, Moliére and Snyder 
and Scott, although we shall make most use of that of Moliere since it 
has the widest range of application and has been presented in a convenient 


form. 
Following is a list of the most important symbols used in the present 
and in the succeeding papers of this series. 


® =projected angle of scattering on a plane at right angles to the 
line of observation. 


a—=deflections due to all scattering in thin sheets of matter as 
found by the theory of Williams. «mean absolute values 
of these angles. 


«®) =mean absolute value of the resultant of individual deflections. 


N, Z, t=number of scattering centres per ¢c.c., atomic number and 
thickness in cm. of the scattering medium. 


2e, p, v, My=charge, momentum, velocity and rest-mass of scattered 
particle. 


m =electronic rest-mass. y =Zz/137f. 
o =T/M,c* =kinetic/rest energy of scattered particle, 
Ru 2¢?(NZ?)iztt 
pv 
s=cell-size in units of 100 microns. 


an angular unit common to all theories, 


(a) Theory of Williams. 

The earliest simple theory of multiple scattering in a form which could 
conveniently be applied to the direct measurement of momentum is that 
of E. J. Williams (1935, 1939, 1940). He assumed that a charged particle 
passing through a thin layer of matter suffers a deviation which is the 
resultant of a very large number of individual deflections, the probability 
of any one of which is given by the Rutherford scattering law. He 


Scattering of Ionizing Particles in Photographic Emulsions 711 


conjectured, moreover, that the angle ® between the direction of incidence 
and the projected direction of emergence is distributed about zero in 
accordance with the normal law. 

The mean square value of all such projected deflections was found 
using Rutherford’s expression for the probability of single scattering, 
and integrating between limits ®,,, and ®,,,... This is taken to be the 
variance of the assumed normal distribution of ®. The result may equally 
well be expressed in terms of (®) the mean absolute deviation of the 
distribution, and this is given by (1). 


2eA(NZ2)itlz /, Br\t 
(by = (In oe) eee a ce 


The angle @,,,, is: the greatest single-scattering angle which can 
contribute to the observed deflection. Physically, it is determined by 
the finite size of the nucleus. As a first approximation to ©@,,,,, Williams 
takes the angle ®, which is such that in traversing a thickness ‘“‘¢” of 
absorber, the particle will suffer on the average just one deflection greater 
than ®,. It follows that ®, =61/(7/2). 

@,,,, is an effective minimum angle. Its magnitude is determined by 
the ‘screening’. In deriving the basic single-scattering formule, 
using the Fermi—Thomas statistical model of the atom to allow for the 
effect of screening, Williams considers separately the cases y =Zz/137B <1 
(Born approximation valid), and y>1 (classical case). For y<1 this 
calculation yields 

1-75mcZi2 


_, —=———-——— e Sey cere - erm GoM (2 
Dain 137p ( ) 


With these assumptions he finds a mean deviation 


“ Wn ZA9N the? : 
= aaa ee Set US A tamtoeaih (3 
a,=8 e/ [in ( ee ) | 5. (In M) (3) 


M is a measure of the average number of collisions suffered by the particle 
in traversing the foil. 

Further calculations designed to take into account the contribution of 
deflections >@,, yield the following expression for the mean deviation 
due to all ® 
(12450-40800) cs wes +», (4) 
(b) Theory of Moliere. 

To avoid the dichotomy y>1, y<1, Moliére (1947, 1948) first solves 
the problem of obtaining an expression for ®,, by making an exact 
quantum-mechanical calculation of the cross-section for single scattering. 
Using the Fermi-Thomas model he obtains an expression (5) valid at 
both limits yS1 and y<l1, and which also holds with sufficient 
approximation in the intermediate region. 


VA 


Prin = 0-4865 x10 8p 


L134 316yhe 2... (5) 


712 K. Gottstein et al. on the Observations on the Multiple 


The problem of the composition of individual acts of scattering to 
yield the observed resultant deflection is solved by the method of 
Wentzel (1922). This leads to the definition of the parameter B which 
is the greater solution of the equation 

B=In B =n Q;-0-115;,.) 025 ee) 
where 2, =78?/@? ,... 

Q, is a measure of the average number of collisions suffered by the 
particle in traversing a distance ¢ in the scattering medium. For the 
approximations to be valid, 1/B” must be negligible for n>2. This 
will be the case for B>4-5, a condition satisfied by most materials for 
t>10-3cem. The standardized distribution of the projected angles of 
emergence will then be 

P(p) dp =[(2/78) e+ f(G)/B+fO(b)/B*] dg, . . - (7) 
where f((¢), f@(d) are functions tabulated by Moliére. The expression 
P(d) db =(2/7*)e-”2 is called by Moliére the normal approximation to the 
scattering distribution, whence we may see that 7*B*6 =oy./2, where 
oy is the standard deviation of this approximation. 

The mean deviation of @ is 

(®) =6 . B#[1+0-982/B—0-117/B7]=6.L, . . . (8) 
and may thus be expressed in a form similar to that of Williams (1), but 
may be expected to hold for all y. Its derivation, moreover, is less 
dependent on artificial assumptions. We may note that as the thickness t 
of the absorber increases, so also does ‘‘ B ’’, and hence that the distribution 
of ® will tend slowly towards the normal as ¢ increases. Physically, this 
is easily understood in terms of a decreasing probability that the observed 
deflection is the result of single or plural scattering. 

Goldschmidt—Clermont (1950) has published an instructive family of 
curves giving L,, (above) as a function of o =T/M)c? for various cell-sizes 
in photographic emulsions. It will be seen that L,, increases slowly 
both with the thickness of the absorber traversed and with decreasing 
velocity of the scattered particle. 

(c) Theory of Snyder and Scott. 

The treatment given by these authors (1949) of the problem of the 
multiple scattering of charged particles is based on a solution of the 
fundamental diffusion equation due to Fermi (Rossi and Greisen 1941). 
The theory assumes the existence of conditions which justify the use of 
the Born approximation. Because of this, and the method of treatment 
of the screening, its validity is restricted to scattering of fast particles in 
thin foils. 

The results are expressed in terms of two dimensionless parameters 
[jo and t/A . n/n is the angle of deflection in terms of an angular unit 7 
determined by the screening radius and defined as follows : 


AG 


Ulta Ayp p) . . e ce) ° . . e (9) 


Scattering of Ionizing Particles in Photographic Emulsions 713: 


where a is the radius of the first Bohr orbit of the hydrogen atom. This 
is analogous to the ®,,,,, of Williams and Moliére. 2 is a unit path-length 
in the absorber. In terms of constants already defined, the expression 
becomes 

U0) Ate ae ae A oss w Sass GLO) 


1/A is thus closely related to the Q,/t of Moliére, and is a measure of the 
number of collisions which take place in unit path-length of the absorber. 

Although tables of the distribution function are given, the values of 
mean deviation must be computed by quadrature methods. For this 
reason, and because the theory of Snyder and Scott is restricted to the 
region of validity of the Born approximation, the present experiments 
have been compared with Moliére’s theory only. 

The work of the other authors (Goldschmidt-Clermont 1950, Groet- 
zinger et al. 1950, and Corson, private communication) indicates that the 
agreement between the theories of Snyder—Scott and Moliére is 
satisfactory for fast particles (B2~1). 


§4. DEFINITION OF SCATTERING CONSTANT. 


We have seen that the above theoretical investigations agree that the 
mean deviation of the distribution of the projection of 6 may be expressed 
in the form 6. L, but they differ slightly as to the calculated magnitude 
of L. 

Since the principal use of the theory is the estimation of the momenta 
of particles by measurement of (©), we may conveniently recast the 
formula as follows: 

K,2t? 
= BA Duk ier aces ee 
On ene ae (11) 


where K , =2e?(NZ?)?L. 

We call K, the scattering constant. Because of the nature of the 
factor L, it will be seen to depend chiefly on the properties of the 
scattering medium, and only slightly upon the values of ¢ and velocity. 

In a mixed medium such as the photographic emulsion, the appropriate 
value of (NZ?) is [YN,Z?]!, where N, is the number of atoms per unit 


ei) Sunt 
volume, and Z, the atomic number of the jth atomic species. Similar, 
but more complicated averaging is needed to compute the effective value 
of L (Moliére 1948, equations (6.9’) and (6.10’)). 


§5. EXPERIMENTAL METHODS. 


There are two well-established techniques of measuring the multiple 
scattering of tracks in cloud chambers and in photographic emulsions. 
In both, the tracks are divided into equal cells of length =s, and changes 
in direction which take place from cell to cell are then measured. 

(1) The angular method in which a direct measurement of the angles 
is made (Goldschmidt—Clermont et al. (1948), Davies, Lock and Muirhead 
(1949)). In principle their method determines the mean absolute value 


714 K. Gottstein et al. on the Observations on the Multiple 


of the angle between successive tangents. In practice, however, angles 
are measured between “ best lines” fitted to the track. It has recently 
been much improved by the Brussels Group (Goldschmidt-Clermont 
(1950), Levi Setti (1950)). 

Lattimore (1948), on the other hand, in his measurements on the mass 
of the c-mesons, used the angular method to determine the angles between 
successive chords. 

(2) The coordinate method of Fowler (1950). 

In this very rapid and convenient method, the coordinates of the track 
at intervals of one cell-length are measured and are used to determine 
the mean absolute value of the angles between successive chords. 

The values of (®) as determined for the same track by the methods 
of tangents and chords are not equal. It may be shown, however, that 
in a sufficiently extended series of measurements, the value of the ratio 
{Dian tO (DYenora tends to (3/2)%. Reducing the value of <®) to 
degrees per cell-length of 100 » and using the results of Moli¢re, we may 
express the scattering constant applicable to the coordinate method as 


degrees x MeV. 


K cnora =8°203B#(1 +0-982/B—0-117/B2) Tea (12) 
L J pL)? 
For the limiting case 62> 1 we have 
degrees x MeV. 
Kg 04-45 ee ig 
(1 WY pe) 

In general we have, for singly-charged particles, 

8-203L,, gs? 
(s, B) (14) 


CD yr ceth a 2+ 
fig 
l-++o 
As so defined, the scattering constant depends slightly on s and o, but 
the value appropriate to the conditions may be found from the curves 
of Goldschmidt—Clermont (1950), already referred to. 
We find that the variation of L with cell-size is very well represented 
by the formula for the case B? =1 


Li) =1-45--a(log e)& a. a) 


where the factor a has been determined empirically to be 1-19. 


§ 6. EXPERIMENTAL RESULTS. 


The results of the present series of determinations of the scattering 
constant are shown in fig. 1 and Table II. Experimental values obtained 
are plotted as a function of the parameter Q, of Moliére’s theory (6), 
which defines the number of collisions per cell. The numerals shown in 
fig. 1 are those identifying the experiments given in Table II. The 
number of collisions per 100 4, Q,/s, is a function of B only. It is given 
in the graph of fig. 2 for particles of charge | e| in G.5 emulsion. 


Scattering of Ionizing Particles in Photographic Emulsions 715 


The upper curve in fig. 1 represents the theoretical variation of the 
scattering constant as a function of Q,, (12). The second line has been 
calculated for the cases where all angles larger than four times the mean 


TABLE II. 
Cell. K without K with 
Experiment Be aire Opis Op» cut-off cut-off 


experimental experimental 


1. Positrons 105 MeV. 1 200 pe 310 620 26-7+0-6 26-2-+0-6 


2. Positrons 185 MeV. 1 400 310 1240 24-9+0-8  24-0+0-8 


3. Protons 336 MeV. 0:46 600 yu 515 3150 30:7-+1:0 29-2+1-0 


4. Protons and mesons 
5-50 MeV. 0:07 80 pu 900 710 


5. Protons 9-35 MeV. 0-02{avy) 72 1320 850 


_ SCATTERING CONSTANT 


ia 


23 


3 Ce re at ite) 2 3 426 67 89 
100 1000 10000 


C2b 
Comparison of experimental results with Moliére’s theory of multiple scattering. 
Curve (a) and points marked ®, without cut-off ; curve (b) and points 
marked O, with cut-off. The numbers refer to the experiments listed in 
Table I. 


“y 


716 K. Gottstein et al. on the Observations on the Multiple 


are eliminated from the statistics. It shows a scattering constant about: 
10 per cent smaller than the upper curve. (Numerical integration of 


fae ¢P(d) dé with P(¢) as given in (7)). We shall deal with this point 


“~ 0 , 
in greater detail in Part II. of the present series. 


The experimental values indicated by black dots have been obtained 
without elimination of large angles. They should be compared with the 
upper curve. Those shown by open circles correspond to results obtained 
after elimination of large angles, and are to be compared with the lower 
curve. The uncertainties indicated are standard deviations calculated 
from the number of angles observed. If the uncertainty in energy is. 
taken into account the limits of error will be somewhat greater. 


goscS 
pe ; 


Q,/s 10° ; 


p? : 
Calculated value of the parameter Q,/s for Ilford G.5 emulsion as a function 
of 6° for particles of unit charge. s=cell-size in units of 100 u, B=v/c. 


The experimental results show a satisfactory agreement with theory. 
As will be observed, however, the difference between the experimental 
results without and with ‘* cut-off ’’ appears to be less than that calculated 
from theory and shown by the difference between curves a and b. The 
statistical weight is such that the discrepancy in the individual cases 
cannot be regarded as significant. However, the fact that in all cases 
the difference observed is smaller than that calculated, seems to indicate 
the operation of some genuine effect. In Part II. of this series this 
discrepancy will be discussed at greater length. It is shown that because 
there we are dealing with an emulsion only 100 » thick, rejecting a large 
signal is likely to have less effect than that calculated by quadrature of 
Moliére’s expression (7). It is probable that the energy of the electrons 
is less accurately known than that of the 336 MeV. protons. In the 


Scattering of Ionizing Particles in Photographic Emulsions (hy 


case of 196 MeV. electrons corrections amounting to 6 per cent must be 
applied to the nominal energy to correct for energy loss by Bremsstrahlung 
and by ionization. We must remark, however, that our statistics do not 
include the tracks of electrons which were suspected of having suffered 
large losses through Bremsstrahlung while passing through the emulsion. 
The scattering constant has been determined by using cell-sizes so chosen 
that the results are little affected by the presence of spurious scattering. 

The results of experiments 4 and 5 (Table IT.), using tracks ending in 
the emulsion, are in good agreement with each other. They correspond, 
however, more closely to the theoretical value without cut-off, even though 
angles >4(@) had been eliminated. 


Influence of Spurious Scattering. 


The influence of spurious scattering on the results obtained for 336 MeV. 
protons, 105 and 185 MeV. electrons is illustrated by the graph of fig. 3, 
in which the mean angles of multiple scattering measured for these 
three groups of particles are plotted as a function of cell-size. In the 
case of electrons, two types of points are plotted. The points represented 
with black symbols (squares or circles) correspond to values obtained 
after cut-off. Those represented with open symbols correspond to values 

_which include large angles. The full line (a) and dotted line (b) refer to 
the theoretical value of the mean angle calculated from Moliére’s theory. 
(a) shows the value without, and (b) with a cut-off at four times the mean. 

Three different microscopes have been used for these determinations. 
To test the “noise’’ of these instruments, the same persons who measured 
the tracks of particles of lower energy have also measured the scattering 
of a fast primary. Using a cell-size of 4000 « the mean angle of scattering 
per 100. of this track was found to be of the order of 0:0007 degrees 
corresponding to a mean sagitta of 0-046 in 10004. Evidently the 
results obtained with cells smaller than 1000, were entirely due to 
spurious scattering. Thus for each microscope we could determine the 
level of the noise for different cell-sizes and they are represented in fig. 3 
by the points located on the lines marked noise. 

For the pair of microscopes used for the electrons, the results of the 
measurements made on the fast primary were sufficiently close to each 
other for us to draw a single noise line valid for both. The noise line of 
the third microscope is significantly lower. It may be observed that 
none of these lines have been found to be inclined at 45° to the a-axis, 
as would be expected if the magnitude of the spurious second differences 
were independent of cell-size. Thus, the observations are consistent with 
the hypothesis of an increase of the spurious second difference with 
cell-size (Part II. to follow). 

Taking the theoretical value of the angle of scattering of our mono- 
energetic particles and the observed value of the ‘noise’ of the 
microscope used, we have calculated curves representing the apparent 
scattering as a function of cell-size. This calculation depends on the 


718 K. Gottstein et al. on the Observations on the Multiple 


assumption that the noise observed with the fast primary is independent 
of the value of the true scattering. The curves are then given by 
(a2,+02)'. a, ,=angle of spurious, genuine scattering. They are 
represented as dotted curves on fig. 2. 

It can be seen that satisfactory agreement is also found between 
calculated and observed values of the apparent scattering, an agreement 
which seems markedly better in the case of 105 MeV. electrons. This may 
be due to the technique adopted for the 105 MeV. electrons. The course 
of the track was plotted point by point on squared paper, a procedure 
ensuring the correction of accidental errors and the removal of obvious 
‘noise spikes ”’. 


“05 


MEAN ANGLE of MULTIPLE SCATTERING In DEGREES in G5 EMULSIONS. 


02 


Q MICROSCOPE M 4045 
. 


EXPERIMENTAL DATA OBTAINED FROM | 
M 4074 


4 a M 40135 


pelle Ald cel 


CELL SIZE in MICRONS 


Graph showing as « function of cell-size :— 


(1) The noise level as determined independently for three microscopes 
(lines marked ‘‘ noise ’’). 


(2) The mean angle of multiple scattering for three groups of mono- 
energetic particles, calculated from Moliére’s theory (lines marked 
phage Moliére’’). (a) — without cut-off, (b) --—-- with 
cut-off. 


(3) The statistical resultant of (1) and (2 (b)), (curves marked ‘‘ Theor. 
+-obs. noise ’’). 


The latter curves may be compared with the experimental data. 


Scattering of Ionizing Particles in Photographic Emulsions 719 


§7. CONCLUSIONS. 


Within the limits of the statistical errors, the results obtained are in 
agreement with theory. The trend of the variation of the scattering 
constant with cell-size and f follows that predicted. In principle, this 
variation should be taken into account when estimating energy from 
measurements of multiple scattering. In practice, however, its effect 
is lessened - because of the experimental procedure generally adopted. 
For a particle of given momentum, the cell-size is chosen so that the 
ratio signal/noise has the minimum acceptable value (about 2). In this 
way a compromise is struck between the conflicting requirements : 

(1) small correction for noise, 

(2) a sufficient number of independent readings of angle of scattering, 

the track being of limited length. 

Thus, fast tracks will be measured in large cell-sizes, but the value of 
constant appropriate to them will differ little from that for slower tracks 
which will be measured in smaller cells. This is because an increase 
with cell-size in the value of the constant will be offset by a decrease due 
to high velocity. This compensation will not hold in the extreme 
relativistic region where the value of B?— 1. 

Thus, in routine measurements on tracks of particles other than 
electrons, it is reasonable to use a constant independent of cell-size and f. 
The value of 26-0 MeV. degree/(100 4)? may be used for this purpose. 

The use of this constant over the whole range of measurement of 
practical importance cannot introduce a systematic error greater than 
+8 per cent. In most cases, the estimate of the energy of a single track 
is subject to a much greater uncertainty due to the small number of angles 
available. The nomogram of fig. 4 has been calculated in order to give 
within 1 per cent accuracy the quantity pS for protons as a function of 
the angle of scattering per 100. For particles of other masses the 
value M&199,, (M—=mass of the particle in protonic units) must be used ; 
it corresponds to the quantity pB/M in MeV. 


ACKNOWLEDGMENTS. 


We wish to express our gratitude to Professor C. F. Powell, F.R.S., 
for extending to us the facilities of his laboratory, and for criticism and 
suggestions from which we have derived much benefit. Our best thanks 
are due to Professor Dale R. Corson who has very kindly put at our 
disposal plates exposed to a beam of electrons selected from the Cornell 
synchrotron, and to Dr. W. H. Barkas for his kind cooperation in 
arranging the exposure of plates to 336 MeV. protons in the Berkeley 
cyclotron. : ; 

It is a pleasure to thank Dr. H. Heitler and Miss J. Burnell for drawing 
the diagrams and Mr. A. R. Gattiker for the photographic reproduction. 

We are much indebted to Dr. G. Moliére of the Max Planck Gesellschaft, 
Hechingen, for communicating to us in manuscript form the results of 


o 


pOlx y 


0004 


= bl MAY, 
: WAS 
: 
1 ie = = 
6 = erie g ; 
eau ee 2 y 
a Is § Wy 


“Ol 


DEGREES per 100, 


n 


THIMASIaKIN 


SST 


HARE PD am 
RSs 


CELL SIZE 
oF 


2 


. 


ye Re ne ee ee ee Te Tee 


olvine 


Nomosram 


On the Observations on the Multiple Scattering of Ionizing Particles 721 


work not yet published. Certain of us wish to render thanks to the 
following bodies for maintenance grants: D.S.I.R. (J. H. M.), University 
College, Cork (C. O’C.), British Council (O.R.). Two of us have been granted 
leave of absence. One of us (K. G.) wishes to thank Professor W. Heisen- 


berg, and the other (C. O’C.), the Governing Body of University College, 
Cork. 


REFERENCES. 


Bost and Cuoupuurt, 1941, Nature, Lond., 147, 240. 

CAMERINI, FowerR, Lock and MutruEap, 1950, Phil. Mag., 41, 413. 

CaRLson, Hooper and Kina, 1950, Phil. Mag., 41, 701. 

Davies, Lock and MurrHeap, 1949, Phil. Mag., 40, 1250. 

Fowuer, 1950, Phil. Mag., 41, 169. 

GOLDSCHMIDT—CLERMONT, Kine, MurrHEApD and Ritson, 1948, Proc. Phys. 
Soc., 61, 183. 

GoLpsScHMIDT—CLERMONT, 1950, Nuovo. Cimento, 7, 331. 

GROETZINGER, BERGER and Ripe, 1950, Phys. Rev., 77, 584. 

Hooper and Kina, 1950, Phil. Mag., 41, 1194. 

Hoopsr, Krye and Morrisa, 1951, Phil. Mag., 42, 304. 

Lattimore, 1948, Nature, Lond., 161, 518. 

Levi Serr, 1950, Note No. 22, Bull du Cen. de Phys. Nuc de VUniv. Libre de 
Brucelles. 

Mo.rére. 1947, Z. fur Naturforschung, 2a, 133; 1948, Ibid., 3a, 78; 1951, 
private communication. 

Rosst and GREIsEN, 1941, Rev. Mod. Phys., 13, 240. 

SNYDER and Scott, 1949, Phys. Rev., 76, 220. 

WENTZEL, 1922, Ann. Physik., 69, 335. 

WituiaMs, 1935, Phys. Rev., 47, 568; 1939, Proc. Roy. Soc. A, 169, 531 ; 
1940, Phys. Rev., 58, 292. 


SER. 7, VOL. 42, NO. 330.—JULY 1951 3D 


bes 


LXXIII. On Combined Bending and Twisting of Thin Tubes in the Plastic 
Range. 


By R. Hu * and M. P. L. SreseEz f, 
University of Bristol f. 


[Received March 7, 1951.] 


SUMMARY. 


Equations for the determination of the stresses and warping in a partly 
plastic thin tube of arbitrary section are obtained for any monotonic 
loading by bending and twisting couples. The analysis is based on the 
von Mises yield criterion and the Reuss stress-strain relations. The 
equations are solved numerically for a circular tube for various loading 
programmes. 


§1. FoRMULATION OF THE PROBLEM. 


THERE appears to be no published calculation of the stresses and distortion 
in a tube which is partly overstrained by the simultaneous action of bending 


and twisting couples §. This problem is relevant to the design of tubular. 


structures where permanent deformation might be advantageously 
permitted in some members. Its solution would be a matter of some 
difficulty when the tube is thick-walled, but is straightforward in principle 
when the variations of stresses through the thickness can be disregarded. 
This is assumed here; the error should be insignificant when the thickness— 
diameter ratio is less than one tenth. Buckling appears improbable 
when this ratio is greater than one fiftieth, so long as a substantial fraction 
of the tube is not overstrained (Donnell 1934). A further simplification 
in treating a thin-walled tube with this approximation is that we avoid, 
without inconsistency, the need to take account of a triaxial state of 
stress such as that in a bent beam when Poisson’s ratio differs from one 
half (Hill 1950, p. 82). 

We shall consider a monotonic process of distortion defined by an other- 
wise arbitrary loading programme. The aim of the analysis is to calculate 
the angles of bend and twist, the warping of the cross-section, and the 
distribution of stress at any moment. 


* Wills Physical Laboratory, Royal Fort. 

+ Mechanical Engineering Laboratory. 

{ Communicated by the Author. 

§ In unpublished work (1950) R. M. Haythornthwaite (Department of Civil 
Engineering, Sheffield University) has examined the problem by an approxi- 
mate method which differs in principle from the analysis of the present paper 
(private communication), 


Combined Bending and Twisting of Thin Tubes in the Plastic Range 723 


§ 2. COMPLETELY Exastic TUBE. 


We confine our attention to cylindrical tubes of uniform thickness t. 
The cross section is assumed to have two perpendicular axes of symmetry, 
about one of which the bending moment is applied. Cartesian axes of 
reference are taken in a typical section (fig. 1) such that Ow and Oy are the 
axes of symmetry, while Oz is directed along the tube. Consider the 
action which the part of the tube z>0 exerts on the remainder, and let T 
be the twisting couple about the positive z axis, and B the bending 
couple about the positive x axis. Lateral forces being absent, the state of 
stress and strain does not vary along the tube. For equilibrium the shear 
stress 7 acting over the section in the anticlockwise circumferential 
direction, must be uniformly distributed at each moment. The longi- 
tudinal stress co, on the other hand, varies round the circumference. 

The longitudinal strain at any point is 


ai ae eee Se ele he eS (1) 


Fig. 1. 


where # is the angle of bend per unit length : that is, the curvature of the 
tube axis. If@is the twist per unit length and w the warp in the z direction 
at any point, the engineering shear strain is 


0 
BE eA Eat Me een, Bo cee (1 aD 
lage Os (w I 20A,), ( ) 


where A, is the sectorial area measured anticlockwise from the # axis, and 
s is the distance measured round the perimeter in the same sense. — 
While the tube is completely elastic the stresses are given by the 
familiar formule 
o=He= Hy, 7t=Gy=2G0A/p, Eve ae ae OY 
where p is the total perimeter of the section, A its total area, KE Young’s 
modulus, and G the shear modulus. The warping is 


w=26(2A—A,), eee =... 24) 
p 


3D2 


724 R. Hill and M. P. L. Siebel on Combined Bending 


and vanishes identically only if A, os, i. €. if the section is such that the 
perpendicular from the centre to any tangent is of constant length. 
This is true, in particular, when the section is circular or rhombic. The 
bending and twisting couples are 


B=EIts, T=2Atr,: «0 nailelae ED) 
where I= ¢ y?ds is the second moment of the contour about the axis of 


bending. The relation between the ratio of the angles of bend and twist 
to the ratio of the couples is therefore 
2 
Shh iy ee (6) 
pl (1+7)T 
where v is Poisson’s ratio. 
We shall suppose that work-hardening is negligible and take the yield 
criterion to be that of von Mises: 
fo? at, Se 
where k is the shear yield stress. It is evident from (3) that plastic yielding 
first occurs where y is greatest: that is, at the extremities of the y axis 
(supposing the section to be convex outwards at all points). Let the 
y diameter of the tube be 2a. Then, if a suffix zero denotes the value of 
any quantity when yielding first occurs, 


Gap, (4A? a \ags 7% (4A? -4 
Dae) een 
where 
p= s(1+)2. 
If the ratio of the couples is given, their separate values at the yield-point 
are found from 
1 /Bya\2 To \a ee 
3 (=) -+ (sk) ad . . . . . . . (9) 
In terms of the strain-ratio r the yield-point couples are 
Bo Ir (4A? at 4p! 2A /4A2 ay 
oe a bee 2 Rien inet ef Pee sak 
oaA TY) KG (= pe ) “Sak ap (Fr +u**) : 


(10) 
§ 3. Partty Piastic TUBE. 


During continued deformation two plastic zones spread towards the 
axis of bending. Since the section has two axes of symmetry, and since 
from (7) o has equal and opposite values in the two plastic zones, the latter 
must cover a region | y| > c, say, at any moment. The plastic boundaries 
|y¥| = ¢ are such that elements just on the elastic side are on the point of 
yielding ; o must therefore be continuous across the boundary. In the 
elastic zone a is still equal to Ey and hence, by (7), 


Ge 2 7 
be I ) ro . . . . . . . (11) 


and Twisting of Thin Tubes in the Plastic Range 725 


In the plastic zones o has the uniform values 


o=+7/{(3(—77)}=42(1+r)Geb. . 2 . . (12) 
The couples are now given by 
Dv: Ge/r i T : 
sem =O" gx (| y ase] |yl ds), Ati 4 hia ee (13) 


where the suffixes P and E denote integration over the plastic and elastic 
zones respectively. 

If Band T are given, the equations (11), (12) and (13) suffice to determine 
T, ¢, cand % uniquely (the loading having been monotonic). The problem 
is thus statically determined. The angle of twist and the warping, 
however, can only be found by following the entire loading history, the 
relations between stress and plastic strain being incremental. When, on 
the other hand, the strain-path is prescribed (and not the loads) the pro- 
blem is not statically determined and the stresses themselves can only be 
found by following the strain-history. 

The Reuss stress-strain relations (see, for instance, Hill 1950, p. 39) 
imply that the shear stress does not affect the through-thickness and 
peripheral strains, which are therefore equal. Denoting them by «’, we 
have 

e+2¢’=(1 =) = EK’ 
since the volume change is entirely elastic. Hence, from (1), (3) and 
(12), 


| (4—v)eb— hyp, c<y<a; | 
c= + —vy, —Cexy<c ; ae. 24.) 
eR Sy er ey <e. 


When v3, the rate of variation in thickness as we travel round the tube 
MENTE discontinuously at the plastic boundaries. We observe also that 
<’ has equal and opposite values at the ends of a diameter. 

The Reuss equations give, further, a relation between the incremental 
changes of stress and strain in each element: 


dy—dt7/G — 3r 
ane oe | y |e. a, ert Kes! ee (15) 
With the help of (1), (2), (11) and (12), this may be reduced to 
0 (dw dA, a ‘ 377 ld 3 ARCA 
ds (5) sdb se 2(1--r)(—7?) | Gdyp | 201+)” Ge’ 
|y|>c. 


In the elastic zone, dr=Gdy or 


0 (53) +3 aA, de Li dt. 


= (= ly |<e. 
as\ ap) 7" ds ap Gap? 


726 R. Hill and M. P. L. Siebel on Combined Bending 


The condition that dw/d should be a continuous function of position 


round the periphery is 
ahaa ds=0 
} Os\ Oy) 


Since this holds at each moment, w itself is then automatically continuous 
(being continuous when the tube was completely elastic). The last three 
equations lead to 


3 a 71 1dr 
beeper. salag 


72\ [2A dé 3 TOT 
= (1-3) [> ap ~zciGeey pe | eee 


If the loading path is given, 7, c, and the integrals can be calculated as 
functions of % as already explained. (16) then gives the quantity d6/d/ 
explicitly as a function of %, and hence @ after integration, using the yield- 
point values from (6) and (8). In particular, if the tube is rendered partly 
plastic by bending alone and then twisted as well, the initial torsional 
rigidity is equal to its value in a purely elastic tube ; for, when 7—0 in (16), 
dz/d0@=2GA/p as in (3). If, on the other hand, the strain-path is given, 
6 is a prescribed function of ys. (16) is then a differential equation which, 
with (11), must be integrated numerically to obtain the unknowns 7 and c. 
The corresponding values of the couples are finally found from (13). 

It is essential to find the limitation on the loading-path such that no 
plastic element ceases to deform plastically. The previous analysis is valid 
so long as the rate of performance of plastic work is not negative in any 
element. In view of (15) the condition for this is 

a(de—doa/E)>0, 
or, from (1) and (12), 
(| y|—c)dp— de>0 when | y|>c. 
This is certainly satisfied when the bending is monotonic (ds >0) and the 
plastic zone increases steadily (de <0) ; the numerical solutions presented 
later fulfil these conditions. 


§ 4. Limrrinc VALUES OF THE COUPLES. 

If either B/T or #/@ is held constant, calculations show that B and T 
approach certain limiting values (work-hardening being neglected). 
The approach is very close even after a deformation which is only a few 
times that at the yield point and so still small enough for the tube to be 
treated as approximately cylindrical. These limiting values of the couples 
have, therefore, a practical significance (especially in regard to buckling 
and collapse), even though they are deduced on the assumption that the 
tube can be treated as cylindrical after unlimited strain. | 

Since the stresses approach constant values the limiting conditions are 
to be found by setting dr/dy=0 in (16). This gives 


3Q0|y]| ds 
joy hla dips 


caer DA ‘do’ ty an ae wee At mee a ae) 


and Twisting of Thin Tubes in the Plastic Range T27 


after using (12) to eliminate cy (which remains finite). When B/T is held 
_ constant, #/@ varies and di}/dé in (17) is the limiting value of the derivative. 
From (6), (11) and (13) the limiting value of the couples satisfy 


B plylas 
ag ie 


2 
ds iI 
do ’ 3 


2 hes, 2 
+ (54) =k? . (18) 


B., 
io ly| ds 
B,, and T,, may be regarded as the couples that would be needed to initiate 
deformation in a'tube of plastic-rigid material (in which the rigidity 
modulus has an infinitely great value). They may thus be alternatively 


obtained by an application of the maximum work principle for a plastic- 
rigid material (Hill 1948). The work done per unit length of tube in the 


Fig. 2. 


B/2k AC 


<— Limiting 
ellipse 


Yield - point 
ellipse 


TiekAt 


first increment of strain is B,,ds+T,.,d@ (the work done in the warping 
having equal and opposite amounts at the ends of the tube). For given ds 
and dé, the work is a maximum for couples satisfying the second of (18) : 
that is, when 

AB,, ds4- AT, dd=0, 


where 
Bebe A de Te 73 


a AE SS, 
a(ehiylas) CY 


On combining these we regain the first of (18). 

It is instructive to compare the yield point couples with their limiting 
values. Equations (9) and the second of (18) define two ellipses if B/2kAt 
and T/2kAt are taken as rectangular coordinates (fig. 2). When a pure 
torque is applied, its value remains constant after the yield-point. When 
a pure bending moment is applied, its limiting value exceeds the yield- 


point value by the factor 


abl yl ds/d yas 


728 R. Hill and M. P. L. Siebel on Combined Bending 


This is greater than unity since|y| <a. The yield-point ellipse therefore 
lies within the limiting ellipse, but their distance apart is comparatively 
small for normal sections. When the section is circular, for instance, the 
above factor is 4/7~1-28. 

On comparing (6) and (18) we see that if ¥/@ is held constant the 
limiting value of B/T is less than the yield-point value when 


2 
&(1+v)pI> ($lylas) ; 


For a circular section this is true when 1+v>12/n? or v>0-216. From 
(10) and (18) it may be shown that T.,>T, if 


g+y)pa> ply [ds ea): 


This is certainly so for v=4, whatever the section, and it is true for 
a circular section for normal values of v. On the other hand, we shall 
see later that, when the deformation is predominantly twisting, B,. <Bp. 


§ 5. WARPING. 


On integrating the previous equations for the warping round the tube 
at a given moment, and choosing dw/d%—0 on the axis of bending, we 
find that 


Ow sdr dé 
ab eG bia dia O<y<c; . 
Ow sdr dé 37? ldt 
a Gap ap + —*) sae Ga oe 
37 2 
tape, |vlees <u; 


where s, is the value of s when y=c. The tube being symmetrical about 
the (x, y) axes, dw/d vanishes also on the y axis, in view of (16). It is 
found, further, that dw/d/ is antisymmetric with respect to the (x, y) axes ; 
that is, it takes the same value at opposite ends of a diameter while the 
values on a pair of symmetrically disposed diameters are equal and 
opposite. 

The warping in any element is obtained by integrating the above 
equations with respect to y%, using the first while the element is elastic 
and the second while it is plastic. 


§ 6. NumericaL Resuuts. 
When the section is circular the bending moment is given by 
B , Ga 
Fay ott +e)(r+sin 20 —2n) sh 


where c=acos«. The calculated variation of the bending moment with 
the angle of bend is shown in fig. 3 for loading under the constant ratios 
B/T=4, 1, 2, and o (v=0-3). The variation of both T and 7 with the 


(20) 


and Twisting of Thin Tubes in the Plastic Range 729 


angle of bend is, of course, exactly similar. The yield-point and limiting 
ellipses are 


ie) 


: F 2 
$B}-+T3=(2rka%)P— = B2+T?,, 


since l=za?, by | ds=4a?, for a circular section. Hence, when B/T is 
‘constant, 

Bo eee ele 

BB, \?B/12T?+1 
‘These limiting values are indicated by broken lines in fig. 3. It will be 
observed that when B/T=3, or 1, B is within 2 per cent of the limiting 
value when the angle of bend is only twice its yield-point value; when 
‘no torque is applied, B is within 5 per cent of its limiting value at this 

Fig. 3. 
B/4/3 kat 
1-0 eer ioe as 


0-8 


06 


0-4 


Limiti 
0 0-2 0-4 0-6 0-8 1-0 2 1-4 1-6 


0-2 
a 
Ga 
ata 


stage. The extent of the plastic zone can be judged from fig. 4, which 
shows the relation between the angle of bend and the fraction of the tube 
which is plastic. 

The relative rate d6/ds of twisting to bending, calculated from (16), 
-is given in fig. 5, and the relation between the angles of bend and twist 
in fig. 6. From (18) the limiting value of d6/d is equal to 12T/7*B, and 
this is indicated by the broken lines in fig. 5. The relation between 
‘torque and twist is shown in fig. 7. 

The warping, calculated by integrating (19), is shown in fig. 8 for 
‘B/T=1 at an angle of bend such that Gaj/k=1-077 ; « is then equal to 
65°, so that just over two-thirds of the tube is plastic. 

The solution for deformation under constant strain-ratios 6/;s=1 and 2 
thas also been computed for Poisson’s ratio $. The variation of the couples 


730 R. Hill and M. P. L. Siebel on Combined Bending 


Fig. 4. 


with the angle of bend is shown in fig. 9. It will be noticed that the 
approach to the limiting value is oscillatory. Despite the fact that one 
or other of the couples (but not both together) temporarily decreases, 


the plastic zone continues to expand steadily (fig. 10) and no plastic 
element of the tube unloads. 


=. 


and Twisting of Thin Tubes in the Plastic Range 731 


§ 7. SoLUTION BASED ON HENCKyY RELATIONS. 


In view of the labour involved in solving the equations based on the 
Reuss incremental-strain relations, it is worth examining whether the 
simpler solution based on the Hencky total-strain relation is in this 
problem sufficiently accurate (though admittedly not acceptable physi- 
cally ; see Hill 1950, pp. 45-8). When Poisson’s ratio is }, the Hencky 
relations lead to 


2GA0 


ante vl |, 
p+ i (#4 —1) ds 
Fig. 6. 
Gag 
-6 
| 


in place of (16), and 


Wee 0A Oy Se 


W=8, ra —20A,+ Ge [lu lds cKy<a; 


732 R. Hill and M. P. L. Siebel on Combined Bending 
Fig. 7. 
Terka’t ; ‘ 


1-0 


0-8 —— 


0-6 


0-4 


0 0-2 0-4 0-6 0-8 1-0 12 1-4 6 
Gap 


in place of (19). The equations (17) and (18) retain the same form, but 
b/0 replaces deb/dé. 


For a circular section, 


thie ey aGae , (Gab Lp t/ a p 
Qn" P2 tan a— 2a’ v3(t) oos2= (1): 1s 


where c=acos«. The solution of (20) and (21) when 6/s=1 and 2 is 
shown in fig. 9 for comparison with the solution for the Reuss theory. 
The limiting values of the couples are the same and the approach is 
again oscillatory. As was to be expected, the difference between the 
values of the couples predicted by the two theories is comparatively 
small when the ratio of the angles of bend and twist is held constant. 


and Twisting of Thin Tubes in the Plastic Range 133 


Fig. 9. 

TorB 
(unit =ka’¢) 
6 ary == ic 


<< 


—— |REUSS 
a EN CIY, 
= — — [LIMITING] VALUES 


0 
(0) 02 0-4 0-6 0-8 6) te [-4 1-6 (-2 
Gay 

k 

Fig. 10 
: ing evi 
40° 
: ee ues | 


eee 


Pies a aS 
Paae Ee 


(=) 
S 
ips} 
So 
eS 
So 
t=) 
i=) 
co 
S 
mr 
aS 
(=>) 


REFERENCES. 
DonneEwt, L. H., 1934, Trans. Amer. Soc. Mech. Eng., 56. 


Hi, R., 1948, Quart. Journ. Mech. Appl. Math.,1, (18); 1950, The Mathematical 
Theory of Plasticity (Oxford : Clarendon Press). 


[ 734 ] 
LXXIV. Internal Barriers in Semi-Conductors. 


By H. K. Heniscu, Ph.D., A.Inst.P. 
Department of Physics, The University, Reading *. 


[Received March 14, 1951.] 


ABSTRACT. 


The temperature dependence of conductivity is calculated for a semi- 
conducting specimen containing internal barriers of various heights. This 
leads to a new interpretation of activation energies as deduced from 
conduction measurements. 


§1. INTRODUCTION. 


THE results obtained by various workers (Verwey 1950) (Volger 1950) 
(Miller 1950) (Chasmar 1948) (Henisch 1950) during recent years show that 
a good deal of importance must be attached to the presence of inter- 
granular barriers in many types of semi-conducting materials. Such 
barriers are expected to constitute a source of high resistance, and the 
effective constants of a bulk specimen may be controlled by them. In 
particular, the frequency dependence of the effective resistivity is usually 
ascribed to capacitive short-circuiting of inter-granular barriers. The 
unexpectedly large dielectric constants observed on some materials can 
also be interpreted on the basis of this model. The barriers in question 
may arise from the discontinuity of the lattice itself or, more probably, 
from adsorbed impurities. (Mott 1950.) 

The temperature dependence of conductivity in micro-crystalline 
specimens containing inter-granular barriers would be controlled by the 
barrier height. If all barriers are of equal height ¢, the conductivity 
would be approximately proportional to exp (—4/kT). In general, how- 
ever, it is more probable that barriers of different height are represented, 
e.g. due to a varying density of surface states at grain boundaries. It is 
shown in this paper how the temperature dependence of conductivity can 
be calculated for this more complicated system. The present treatment 
can also be generalized to apply to internal barriers which may arise in 
single crystals. The conclusions are of general importance in connection 
with the interpretation of activation energies derived from conductivity 
measurements over a range of temperatures. 


§ 2. TEMPERATURE DEPENDENCE OF EFFECTIVE BULK CONDUCTIVITY. 

The calculations in this section are restricted to direct or low frequency 
currents, and it is assumed that the various barrier heights are normally 
distributed about some mean value with a certain standard deviation. 


* Communicated by the Author. 


On the Internal Barriers in Semi-Conductors 735 


The voltage across any particular barrier is taken to be small compared 
with kT/e, so that the barrier resistance can be treated as independent of 
voltage. Deviations from Ohm’s law would normally be expected only 
if measurements are carried out with high-voltage pulses. 

Given a cross-section through the material perpendicular to the direction 
of current flow, it will be clear that most of the current will cross this section 
in places where the barrier height ¢ is small, since the current is very 
sensitive to changes of ¢. This applies to any section. The current will 
thus flow along filaments of low ¢ value, and the experimental observations 
will therefore relate mainly to the properties of these filaments. Let the 
number of barriers per filament be .. Among these barriers, various 
values of ¢ will be represented, and the number of barriers d./”,, whose 
height is between ¢ and ¢+-d¢ will be given by 


N 1 /¢—4\?2 
a eis Oe eras 
Gey (on\” | 3 ( $ ) | a, (1) 
if b>s, where d and s are the mean ¢ and its standard deviation for the 
particular filament. We shall assume, for the sake of simplicity, that all the 
filaments are of equal cross-section and that they contain equal numbers of 


barriers. The barriers connected in series in this way will carry a common 
current 4), so that 


aAj=U, exp (—¢,/kT)=U,, exp (—¢,/kT)= .... U, exp (—¢/kT), 

3) 
where « is some constant and U, is the barrier voltage. The voltage 
across the dV, barriers of height ¢ will be U,d.”,. Using the continuous 


frequency distribution, the total voltage applied to the ends of the filament 
becomes 


U=; = \. U, exp | - : (=) alias plex trl 2) 


Substituting from (2) this gives 


pil ee : Foedae A 
v= 4555 | exp] — 5p (#28848 Gr) [a 
It is now convenient to put y=d—(¢+s2/kT). Substitution and re- 
arrangement then leads to 


2aN a {- | 6 


ai, exp (— y7/28?) dy, 
U=4) Tem a3 Plex: x 


) 


and hence 


U= Aju exp [(d+s/2kTkT]. .. . -. > . (6) 


This shows that the principal term which controls the temperature 
dependence of the filamentary current is still approximately exponential, 


a 


736 H. K. Henisch on the 


provided s is not too large and the considerations are applied toa restricted 
temperature range. The effective barrier height is then given by 


demo ea/2KL.. in\l oak ee 


The above considerations apply to a single filament. It is then 
necessary to sum the contributions 4j towards the total current over all 
the filaments which are now connected to parallel and which are character- 
ized by somewhat different values of ¢ and s. If it is assumed that the 
various values of ¢ are themselves normally distributed, this second 
summation leads to a new effective barrier height, characteristic for the 
parallel assembly all the filaments. Thus, the resistivity would still be an 
approximately exponential function of temperature. It can be shown 
that this new effective barrier height will be given by an expression of the 
same form as (7), though the terms involved will have somewhat more 
comp'icated definitions. 

Consider now a practical example in which a quantity €,), is deduced from 
conductive measurements within the temperature range 200° K. to 500° K. 
using the expression 


o=0p exp (—€,},/24T). “Sule ic ite ke Gea ae 


We shall take ¢ as 0-1 eV and s as 0-01 eV. For an impurity content 
of 10!” centres/cm.? and a dielectric constant equal to 10, the establishment 
of a barrier of this height would require less than 101° trapped electrons per 
unit area. Taking into account the temperature dependence of « which 
would be controlled by the activation energy « in the interior of the semi- 
conductor, we obtain for the average value of the exponent 


Corse t[(Per)it+(Perlel, + + - - + + (9) 


where the suffixes refer to the two temperatures. Substitution into (7) 
shows that 


(der): =0-108 eV. (200°K) and 
(her) 2=0°101 eV. (500° K) 
Hence Eps = 9°204-+< eV. (200° K to 500° K). 


The evaluation of ¢,,,, thus involves an uncertainty of 


2[ (Perr) n= (dem) 2l/<obs> 


which is less than 2 per cent, the exact value depending on «. This is quite: 
comparable with normal experimental accuracy for measurements of this 
kind. Hence a standard deviation of as much as 10 per cent in the value 
of # would not normally result in any detectable departure from the 
exponential form of the temperature dependence. 


Internal Barriers in Semi-Conductors 737 


§ 3. Discussion. 


It is a common observation that the activation energies measured on 
different specimens of the same substance can vary between relatively 
wide limits. Considering equations (7) and (9), it is plausible that the 
value of ¢.¢ and its mean over a temperature range could vary from 
specimen to specimen, depending on the precise conditions prevailing at 
inter-granular contacts. On the other hand, the true activation energy « 
is expected to be a constant for a given substance, unless the impurity 
content is very high. Variations of ¢,,, in micro-crystalline semi-conductors 
of low impurity content must, therefore, be ascribed to the existence of 
inter-granular barriers. Ifthe impurity content is very high, an alterna- 
tive interpretation may be available, based on the interaction between 
neighbouring impurity centres. However, no satisfactory quantitative 
theory on impurity interaction has so far been established (Seitz 1950). 
In any case, such interaction would be expected to produce a whole 
spectrum of activation energies, corresponding to random variation of the 
distance between neighbouring centres. An averaging process similar 
to that described in the previous section would thus be again required 
to account for the observed exponential temperature dependence of 
conductivity 

The above refers to activation energies deduced from conduction 
measurements. Activation energies deduced from  tbermo-electric 
measurements which do not involve current flow should be almost 
independent of inter-granular barriers. In micro-crystalline specimens, 
thermo-electric activation energies should thus be generally smaller than 
€ops: Lhe exact consequences of grain structure and inter-granular 
barriers on the temperature dependence of the Hall coefficient still remain 
to be ascertained. 

If variations of ¢,,, from specimen to specimen arise in single crystals of 
low impurity content, an additional hypothesis would be required, though 
it is one which is thought to be readily acceptable. It would be necessary 
to assume that the concentration of impurity centres is subject to local 
variations. This would produce localized space charges and fields in the 
manner already described (Henisch 1948), no matter whether we \are 
dealing with a single crystal or with a granular medium. Any electron 
traps occurring within the crystal would have similar consequences, 
particularly if they occur in clusters. The barriers which impede the 
passage of current carriers are then no longer “ inter-granular”’, but a 
treatment similar to that used above could again be applied in principle. 
As in the previous case, we would be concerned with a whole spectrum of 
barrier heights, and the value «,,, resulting from their summation could 
vary from specimen to specimen. The activation energy « could then be 
‘taken as a constant for the material. This treatment is also applicable 
to deficit semi-conductors for which no satisfactory theory of inter- 
granular barriers is at present available. 


SER. 7, VOL. 42, NO, 330.—JULY 1951 38 


738 On the Internal Barriers in Semi-Conductors 


ACKNOWLEDGMENT. 


Grateful acknowledgment is made to Professor R. W. Ditchburn for 
his support of this research, and to him, Dr. C. A. Hogarth and Dr. T. B. 
Rymer for the help received during many stimulating discussions. 


REFERENCES. 


Cuasmar, R. P., 1948, Nature, Lond., 161, 281. 

Hentscu, H. K., 1948, Electrical Communication, 25, 163 ; 1950, Proceedings of 
the Conference on the Properties of Semi-Conducting Materials, University 
of Reading (Butterworths 1951), p. 234. 

Mitter, P. H., 1950, Proceedings of the Conference on the Properties of Semi- 
Conducting Materials, University of Reading (Butterworths 1951), p. 172. 

Mott, N. F., 1950, Proceedings of the Conference of the Properties of Semi- 
Conducting Materials, University of Reading (Butterworths 1951), p. 1. 

Se1rz, F., 1950, Proceedings of the Conference on the Properties of Semi-Conducting 
Materials, University of Reading (Butterworths 1951), p. 8. 

Verwey, E. J. W., 1950, Proceedings of the Conference on the Properties of Semi- 
Conducting M. aterivals, University of Reading (Butterworths 1951), p. 151. 

VouaER, J., 1950, Proceedings of the Conference on the Properties of Semi- 
Conducting Materials, University of Reading (Butterworths 1951), p. 162. 


[e739 | 


LXXV. The Moving Griffith Crack. 


By Exizasera H. Yorre, Ph.D. * 


[Received February 28, 1951.] 


SUMMARY. 


_ The stress field is calculated about a straight crack moving through 
an elastic medium. The stresses depend on the velocity and reduce to 

Inglis’ solution when the velocity is zero. The results may be applied 

to the spicular fracture of glass. . 


INTRODUCTION. 


THis problem was proposed by Dr. Orowan in connection with the 
spicular fracture of glass and other materials. It is hoped that he will 
shortly publish a full account of its applications both to brittle fracture 
and to the multiplication of dislocations in a strained crystal. In the 
meantime the mathematical solution is given here for the stress field 
around a crack propagating with a finite velocity, and the stresses near 
the head of the crack are compared with those of the static case. The 
known stresses about a stationary crack show that fracture may be 
expected to occur in the line of the crack, but Dr. Orowan anticipated 
that at high velocities there might be a tendency to turn out of this line, 
giving a curved or branching crack. It was assumed that fracture would 
occur normal to the maximum tensile stress, and the stresses are therefore 
calculated at points on a circle of small radius about the head of the crack. 


Mi, 

The problem is to find the stresses near the head of a rapidly propagating 
crack in a plate of isotropic elastic material. The crack has the form of 
a straight narrow slit, and may be regarded as the limit of an ellipse of 
semi-axes a, b where b approaches zero. If a sufficiently large transverse 
tension T is applied to the plate, the crack may be expected to spread in 
both directions along its length. In this treatment it has been assumed 
for simplicity that the crack propagates to the right only, remaining of 
constant length 2a. This is justified by the fact that the stress 
distribution close to one end of the crack is not influenced by its distance 
from the other end, as is shown later. 

By keeping the crack of constant length the problem becomes that ofa 
plate in uniform tension, across which a disturbance passes at a constant 
speed and without change of form. This disturbance is represented by 
a system of elastic waves, from which the solution is obtained. 

The method used is very similar to that of Eshelby (1949) in his 


investigation of a moving dislocation. The same form of solution is 
et ee eee 
* Communicated by Sir Lawrence Bragg. 


3H2 


740 Elizabeth H. Yoffe on the 


obtained by representing the moving disturbance by two sets of surface 
waves, but there are some differences in the treatment of the boundary 
conditions. 


§ 2. 
The boundary conditions for a crack extending along the z-axis from 


x——a to x=a across an otherwise uniform tension field are 
| ia—0, 
yy=T, + at infinity 
xy=0, 
yy=—0=2y on y=0, —a<u<a. 


The uniform tension is removed in order to consider separately the 
moving part of the system. This has the boundary conditions 
W= Ah 
ee. on y=0, —a<u<a 
xcy=0, 


and all stresses zero at infinity. 

The solution must be symmetrical with respect to the x-axis, and it is 
therefore sufficient to consider only the upper half-plane y>0, provided 
continuity is maintained at the axis, outside the crack. Let uw, v be 
components of displacement in the x, y directions, and let 


xz’ =x—cl, 


where c is the velocity and propagation and ¢ the time. Then a dynamic 
elastic system is to be determined throughout y>0, satisfying the 
boundary conditions 


yy=—T ony=0, —a<2’<a 
xy=0 on y=0, all a’ ee ee 
v=0 on y=054) 11 a)| a 
and all stresses zero at infinity. 


§ 3. 


In accordance with the general theory given by Love (1920), elastic 
waves may travel with velocity c in the a direction at the surface of the 
half space y>0, without displacement in a direction normal to the 
x, y plane, provided they are composed of displacements of the two types 


ujy=A, exp (—ysy) sin sa’ , 
v,;=A,y exp (—ysy) cos sz’, 
Ug=B,B exp (—fsy) sin sx’, SSeah aietesistd 


V,=B, exp (—Bsy) cos sx’, 


Moving Griffith Crack 741 


where 
P=1—Clcy, 
pr 1—c*/e8, 
A,, B, are arbitrary constants, and c,, Cc, are the velocities of propagation 
of longitudinal and shear waves, 
pct = (A+ 2u)c2 eo GS Es LS AG), 
where A, » are Lame’s elastic constants. 
Displacements such as (2) decrease rapidly to zero as y increases, 


provided s is positive, and so represent a local disturbance near the x-axis. 
Try a solution of the form 


U=U, + Us, 
V=V1 103. 


The shear stress is to vanish on the a-axis, and is given by 
which is zero when y=0 if 


Replacing B, by this expression, and integrating for all positive values 
of s gives a general form for the moving disturbance, 


u=U,+0U, 


- | E A(s) | exp (—ysy)— re maexp (— Bey) | sin sx’ds, 


v=V,+V, 


(oa) 2 ; 
== F yA(s) | exp (—ysy)— TEeiens (— pay) | cos sx’ds. 


These displacements are not biharmonic functions of x and y as in static 
elasticity, but harmonic functions of the variables a’, By and yy as shown 
by the relations 
Vi +yU =f (2'+tyy)=S (21) | (5) 
BV2+1U0,=9(x' +1By)=9 (22); 
where f, g are analytic functions of the complex variables 2, 2.. 
The stress components are given by 


742 Elizabeth H. Yoffe on the 


Differentiating equations (5) gives 


ote Ove weno 
f'(2)= ae +ty On 
_ aU, _ ia¥, 

Oy y oy 

ovo ou 
'(%)=B a> +i=> 
pel oUs ae ove 

B oy oy 

= Rots 


where P, Q, R, 8, are real functions. The stresses may then be expressed as 


= (“St ay) +28, 


T= (5 —0+24)7) Q— 8, : a ene 
&y=2pP+p (a 5) R. | 


By (4) the functions f, g in this problem differ only by a constant factor 
when referred to the same argument, 


e)=— 7 fe). «eat Ly eS 


§ 4. 


The problem is now to determine the unknown function A(s) in (4) 
so as to satisfy the boundary conditions (1). Direct substitution in the 
conditions 


yy=—T (—a <2’ <a), 
v=0 (|x |>a) 


would give equations of the form 


r co 
| sA(s) cos sv’ds=constant (—a<a’<a), 
0 


| A(s) cos sa‘ds=0 (aula): 
0 


It is clear that more convenient equations would be obtained if the 
boundary conditions determined either 7 or v for all a’, instead of part one 


and part the other. Such equations are obtained by reasoning from the 
static case. 


Moving Griffith Crack 743 


The static stress field due to an elliptical hole in a body under uniform 
tension was calculated by Inglis (1913) with the straight narrow crack 
as a particular case. Removing the uniform tension field and changing 
from elliptic to cartesian coordinates, his solution gives the following 
displacements and stresses on the z-axis : 


x (|u|<a) 
an 
aap eave ani a) 
at / (x? ) (1x<—a), 
_TA+%) | Vi@—a) (|e |<a) (8) 
~ yA+H) | 0 (|x|>a), 
(—T (| |<a) 
at=yy== 4 x 
(Gea!) (eee) | 
ay=0 


where in each case the positive square root is to be taken. 
Westergaard (1939) has given this solution in a more convenient form 


HZ —yInZ’, ) 
yy=NZA-yImZ’, ae et ts oe (9) 


xy=—yRtIZ’. 
where 


yA nh 2 
be dz /(2—a*) (1- =z) : 


2=a-+1y. 


When the crack moves some of these boundary values must ne 
For, from (5) and (7) 


V,+iyU,=f(e’), | 
2p ; on y=0 
PV ga (Uae — 7 aa! ; 


and therefore 


u=Inf (2) (- 7) 


v= — Rifle’) (25-1), 


744 Elizabeth H. Yoffe on the 


and from (6) 
A 
a= (= — 0-24) Q-2 


A 4B \ oO 


0 / 
=H [Imf(e')], 


where 


=— = (4 24)74 ; ae 
So if 

iT (a'—V/(e'—a?)) 
2(A+#) [(1/y)—28/(1+-B*)] ° 
(the real square root being taken with the sign of x’) then wu has the same 
value as in the static case, but v and 7 are multiplied by new factors : 


— 2B. platpy—1y [Ve—2?) ( |x’ |<a) 
2(A+#) [1/y—28/(1+-B?)] 


fla"}=— 


0 (|< |=a); 

qa oe | es yeas 
ra GYP nO) bel eT x ; 

2(A+e) [1/y—28/(1+B")] Teas —} (|x |>a). 


Alternatively f(x’) may be so chosen that v, or yy, keeps its static value, 

when w and yy, or wu and v must be multiplied by factors of a similar 

nature. Therefore since v remains zero for | x’ |>a whatever the factor, 

it appears that the boundary conditions (1) may be satisfied by choosing 

yy to have its static value along all the x’-axis. That is, on y=0, 
fle)=—F (e'— Vea?) 


—T |x’ |<a 
T(Seacay —}) |x’ |>a 
=, tt (TR Nl (| 2’ |<a) 
i a(; +R) Ciatane ) (at|>a) 
( 
(| @ 


_T/ 2 _,\{v@t—2) 
=a 0 


With any one of equations (10) in place of the boundary conditions (1) 
the problem is readily soluble. 


(10) 


| x’ |<a) 
) 


|e |Se 


Moving Griffith Crack 745 


§ 5. 


On y=0, from (4) and (10), 


v= — J. yA(s) (x -1) cos sx’ ds 
Es at 2a) V(a—a?) (| 2 | <a) 
H\1-+-f° 0 (| x’ |>a). 


The cosine transform of this function is given by Titchmarsh (1937) as 
a particular case of the functions 


(I—a2y—¥2 (0<a<1) 
2D y+ da-"I (zx) 


0 (x>1) 
where J, is the Bessel function of order v. This gives 
_ Ta J,(as) 
AU me meee (11) 


With A(s) determined, the values of wu and v may be calculated from (4) 
for all points in the upper half plane. For it is known (Watson 1922) 
that 
ie exp (~pi)Ja(qt) 4, V(p'+9")—p 
0 t q 
for Rf(p+iq)>0 and R{(p—iq)>0, so that 


V,t%0,= I. yA(s) exp (—ysy+isz’) ds, 


vy la (Pexp (22,8)J1(a8) 5 
=|, s 2 
T; 
= qa-va-@) 
=f(21) tOlgy Ue 0A tear, 1 in (12) 
and similarly 
2BT% 
BV.+1U, a (22 a/(%—a?)), 
=9 (22) 
Then 4 
z 
fe)=— 3 (1- aaa) | 
one ; L (13) 
iG ps iM ps pS 
eam am (| eaa) | 
=R+i8, 


and the stresses are given by (6). 


746 Elizabeth H. Yoffe on the 


The problem is now solved since a system of surface waves has been 
found which satisfies the boundary conditions (1). It is easily verified 
that each stress component tends to zero at infinity. 


§ 6. 

To show that the solution remains correct at zero velocity it is necessary 
to take the limit as c+0. 

For ¢ small 


. ce 
PIS GP 
. Cc 
jean 202” 
4B ie 
ae 
C2 
He sia) ee 
Then 
—1ic?T 
f@+= Owe (2 9/(27=q2)); 


ne 
fla)=f'2— xh (2) 
, / Jira vt 
9/(@)=—-f@t saul") 
=R+718 


and the stresses are given approximately by 
eB (a+ =) Q+2uS8, 


C2 
I= — (4-040 =) Q—228, 
1 
xy=+2u(P+R). 


On substituting, and retaining only the terms of zero order in c2, the 
stresses at zero velocity become : 


2 
R=—wyS (1-3) REF!" (z) +( +n) S Inf (2), 
2 
=H 5 (1 4 +) Re'"(2) +0+H) 5 5 Inf (z), 


an C 
aw (1-9) 
il 2. 


Moving Griffith Crack 747 


Since 1—c7/e3=—(A+p)/u, and f’(z)=ic2Z/(A-+p)c2, these expressions are 
m agreement with (9) and the solution is therefore correct at zero velocity. 
To show that the solution is unique, assume two different solutions 

satisfying the same equations and boundary conditions, and specified 
by functions f,(z), f(z), say. Let the difference of these be h(z). Then 
since the equations are linear the difference in stress of the two solutions 
is obtained by replacing f by h in the above equations. But at. zero 
velocity these stresses are specified, so the differences must be zero, giving 

Im(h!"(z))\=0=R(h'(z)), 

Im(h’(z))=0 
at all points. Also, from the: boundary conditions, 

h'(z)=0 at infinity 
and. 
Rl(h(z))=0 on y=0, |x|>a. 

So h(z) is at most a pure imaginary constant, which can only add a 


uniform w-displacement without affecting the stresses. The solution 
is therefore unique. 


S75 
The next step is to calculate the direct stress acting across a radius 
from the point z=a, at points very close to this centre but at various 
positions about it. If polar coordinates (7,6) are taken with centre 
xz=a, y=0, such that 
x—a=r cos 0, 
y=r sin 0 
then the stress 66 is to be calculated for a small constant value of r, and 
6 varying from 0 to 7/2. 
Let . 
z=a+re 
—a(1-+ke'*), 
where k is a small constant. Then 
2,=a(1+k,e"), 
2,=a(1+k,e™), 
where 
k,=k(cos? 6+? sin? 6), 
tan 6,=y tan 6, 
ky—k(cos? 0+? sin? 6)}, 
tan 6,=f tan 0. 


(14) 


748 Elizabeth H. Yoffe on the 


As k approaches zero, higher powers of k may be neglected in 
comparison with the first, and the following approximations are valid : 


PHiQ=—¥ (1-7) 


Ti (1 __ exp so) 
pet Vv (2k) 


T Caine 0, 
~ HV Qh) (sing at ee} 
an 2B i ML ora 3) 
R+i8= cs (1-+ B®) Hy/(2k,) (sin 2 +2 cos 2): 
The stress is given by 
00=22 sin? 0-+-7y cos? 0—2%y sin 6 cos 6 
rasp (- -y) Onn (- +7) Q cos 26—2yS cos 26 
id ee 


1 


B 


- rca (4»(9-9) -oG-9 9) 


—2u sin 26 sin 3) ] i (cos? 6+-y? sin? 6)! 


—2uP sin 26—p ( +8) R sin 26, 


+ E cos 26 boa 


2B 
1+ 2 


+p (; +8) sin 20 sin ll / (cos? 0+? sin? ay} : 


(15) 


with k appearing only in the constant factor. Since a, the half length of 
the crack, appears. nowhere else in (15) it cannot affect the distribution 
with @ of the stress 00, but only its absolute magnitude. Therefore for the 
purpose of this calculation the actual length of the crack is irrelevant, 


as mentioned earlier. 


The equation (15) has been used to calculate 66 at various velocities 
and positions, as shown in Table I. It was assumed that A= giving 
Poisson’s Ratio 4. The limiting form of (15) when the velocity approaches 


Zero 1S 


V (2k) 2? 


Moving Griffith Orack 749 


which is in agreement with Inglis and Westergaard. As the velocity 


increases the curve for 08 as a function of 6 first flattens and finally shows 
a definite maximum at a considerable angle from the z-axis. 


TABLE I. 


Fig. 1 
15 
AR 
vie 
10 
0:5 
0 30 A 00 90° 


§ 8. CONCLUSION. 


From these results it is to be expected that, if the material is such 
that a crack propagates in a direction normal to the maximum tensile 
stress, there is a critical velocity of about 0-6 c, at which the crack tends 
to become curved. At a lower velocity the crack extends in a straight 
line, but as the speed_increases the 66 curve becomes flat and the crack 
may form branches, since nearly equal stresses exist over a wide are 
about the head of the crack. At higher velocities the 60 curve has a 


750 On the Moving Griffith Crack 


definite maximum again, and each branch tends to curve, as is often seen 
in cracked glass. These effects have been elegantly demonstrated by 
Dr. Orowan in sheets of cellophane, and he is expected to discuss them 
more fully elsewhere. 


ACKNOWLEDGMENT. 


My thanks are due to Dr. Orowan for many helpful discussions and 
to the British Iron and Steel Research Association for a grant. 


REFERENCES. 


EsuHeEsy, J. D., 1949, Proc. Phys. Soc. A, 62, 307. 

Inaeuis, C. E., 1913, Trans. Inst. Naval Arch., 219. 

Love, A. E. H., 1920, Elasticity (Cambridge : University Press), p. 312. 

TrrcumarsH, EK. C., 1937, Fourier Series and Integrals (Oxford: University 
Press), p. 178. 

Watson, G. N., 1922, Bessel Functions (Cambridge : University Press), p. 386. 

WESTERGAARD, H. M., 1939, J. App. Mech., 6A, 49. 


ewiaeo s 


LXXVI. Total Cross-Sections of the Elements for 156 MeV. Neutrons. 


By A E. Taytor, T. G. Prckavance, J. M. Cassets and T. C. RANDLE. 
Atomic Energy Research Establishment, Harwell*. 


[Received April 9, 1951.] 


ABSTRACT. 


Measurements have been made of the total cross-sections of 10 elements 
for neutrons of effective energy 156+3 MeV. The results are discussed 
in the light of the theory of Fernbach, Serber and Taylor (1949). 


§1. Iyrropvuction. 


MEASUREMENTS of the total cross-sections of hydrogen and carbon for 
neutrons of 156+3 MeV. have already been reported (Taylor, Pickavance, 
Cassels, and Randle 1951). A wider range of elements has since been 
investigated using the same apparatus and geometrical arrangement as 
before. Only the details of the attenuators themselves need to be 
described here. 


§ 2.. ATTENUATORS. 


Attenuators of metallic elements were machined to 23 inches diameter, 
while attenuators in powder or liquid form were contained in tubes of 
24 inches internal diameter fitted with thin copper end windows. 
Corrections were applied to the results for these copper windows, and in 
one case (lead) for a small amount of impurity. 

Using the previously determined value of the hydrogen cross-section, 
the deuterium cross-section was obtained from the difference between the 
attenuation produced by a heavy water sample and a sample of distilled 
water containing the same number of oxygen atoms. The same measure- 
-ments were also used to determine the oxygen cross-section. For the 
beryllium cross-section, the attenuation of the neutron beam by anhydrous 
powdered beryllia was measured, and allowance made for the oxygen atoms. 
In a similar way anhydrous barium carbonate was used to determine the 
barium cross-section. 

The lengths of the attenuators and the geometrical arrangement were 
such that only small corrections had to be applied for elastically scattered 
neutrons which reached the detector (Placzek and Bethe 1940). These 
corrections were 0-6 per cent of the cross-sections of lead and barium 


* Communicated by the Authors. 


752 A. E. Taylor et al. on the Total Cross-Sections 


carbonate, and less than 0-3 per cent for all the other attenuators. The 
attenuators varied in length from one to three times the mean free path 
of the 156 MeV. neutrons in the attenuator material. 


§ 3. RESULTS. 


The results of the measurements are contained in the Table, the last 
column of which records the length of attenuators employed. 


TABLE 
Neutron total cross-sections at 156-3 MeV. 


Pater. Collision radius Attenuator 
Total cross-section joy 4 
r=(o;/27)? length 


Element 24 2 
o, x L0** (cm.”) x 108 (em.) (gm. em.~?) 


Hydrogen 0-0464 +0-0012 0-859 as (CH,)n 
136-93 


Deuterium 0-0707 +0-002 1-061 as D,O 
185-1 


Beryllium 258 +0-014 2-02 as BeO 
43-66 


Carbon 0-330 0-003 2-26 120-23 


Oxygen 0-430 -L0-004 


Aluminium . +0-011 


Tron . +0:-016 


Copper 1-376 +0-018 


Barium 2-476 0-088 


Lead 3-499 +0-026 156-62 


§ 4. Discussion. 


Fernbach, Serber, and Taylor (1949) have investigated theoretically the 
total cross-sections of the elements for high energy neutrons. The various 
nuclei are regarded as spheres of refractive and absorbent material which 
distort the wave functions representing the high energy neutrons. There 
are three parameters entering into the problem :— 


(i.) the nuclear radius R, which is here taken equal to 1-37 Al? x 10-18 
cm., A being the atomic weight of a nucleus whose charge is Ze. 


of the Elements for 156 MeV. Neutrons 753 


(ii.) the absorption coefficient K for high energy neutrons inside the 
nucleus. The value of K is calculated from Opp and o,,, the free 
neutron-proton and neutron-neutron cross-sections. 


3 
r a 47 R3 [Zon Fnp ac (A aE: Z)%inF mn] 


where «,,, and «,, are numbers smaller than unity which take into account 
the fact that some collisions inside the nucleus are forbidden by the 
Exclusion Principle (Goldberger 1948). Because neutron-proton 
scattering is large in both forward and backward directions, «,, is expected 
to be smaller than «,,,,. 


(iii.) the average potential energy V of the neutron inside the nucleus. 
For the moment V will be regarded as an adjustable parameter. 


Fig. 1. 


2 4 6 a 
2 Rx10!3CM. 


Experimental and theoretical values of w as a function of R, 
for 156 MeV. neutrons 


w=0;/27R?, Ra=1-37 At? 10-* em. 


In fig. 1 are plotted the experimental “ specific total cross-sections 2 
w=0,/27R2, of the elements at 156 MeV. neutron energy. Theoretical 
values are also shown for V=0 and 10 MeV., the other constants bane 
taken as o, )=0°72, %,,=0°85, Uy) = 46 X 107?" cm.”, and Onn 57 X 10 
em.2. The figures for «,, and o,,, have been obtained by supposing the 
neutron-neutron and proton-proton interactions to be equal. 


SER. 7, VOL. 42, NO. 330.—JULY I951 3F 


754 A. E. Taylor et al. on the Total Cross-Sections 


Since changes in w are proportional to V? in the neighbourhood of V=0, 
a very good fit to the data could obviously be obtained by taking V~6 MeV. 
This is smaller than the normal Hartree potential of 30 MeV. which is 
required to confine the nucleons inside a nucleus. The difference is not 
altogether surprising. The Hartree potential is an average over both 
near and distant encounters between nucleons, and in the total cross- 
section calculations near collisions have already been taken into account 
through K. 

The overall situation at high neutron energies is not, however, so 
satisfactory as the discussion so far might seem to indicate. The results 
at 275 MeV. (Fox, Leith, Wouters and MacKenzie 1950, DeJuren 1950) 


Fig. 2. 


° 2 4 6 


R10 CM, 
Experimental and theoretical values of w as a function of R, for 275 MeV. 
al e. . ’ ease gos 7 . 
neutrons. The experimental errors quoted for o, (Fox, Leith, Wouters 
and MacKenzie 1950, De Juren 1950) are in most cases smaller than the 
diameters of the circles. 


are compared in fig. 2 with theoretical values of w for V=0, «,,=0-83, 
Onn =0°92, o,,=35X10-*? cm.?, o,,=5010-2?cm.2. There is a 
considerable discrepancy, and this cannot be resolved by adjustment of V 
since w is a minimum for V —0. 

The most obvious means of escape is to reduce o,, to about 
27 10-*7 em.?, although even then the fit is not perfect. The situation 
here hardly provides strong evidence that the proton-proton and neutron- 
Pte ay cael not equal at high energies, but there is certainly 


of the Elements for 156 MeV. Neutrons 75 


OU 


ACKNOWLEDGMENTS. 


We wish to thank Mr. E. Wood for his valuable assistance in carrying 
out the measurements, and the cyclotron crew for their cooperation at all 
times. 

This paper is published by permission of the Director of the Atomic 
Energy Research Establishment, Harwell. 


REFERENCES. 


DEJUREN, J., 1950, Phys. Rev., 80, 27. 

FERNBACH, 8., SERBER, R., and Taytor, T. B., 1949, Phys. Rev., 75, 1352. 

Fox, Letra, Wouters and Mackenziz, 1950, Phys. Rev., 80, 23. 

GOLDBERGER, M. L., 1948, Phys. Rev., 74, 1269. 

PLaczEK, G., and Beran, H. A., 1940, Phys. Rev., 57, LO75A. 

Taytor, A. E., Pickavancg, T. G., Casseus, J. M., and Ranptg, T. C., 1951, 
Phil. Mag., 42, 20. 


382 


[eeToOR™ | 
LXXVII. Metallic Conduction—The “ Internal Size-Effect ”. 


By D. K. C. MacDonatp, 


Clarendon Laboratory, Oxford*. 


[Received April 12, 1951.] 


ABSTRACT. 


The conception as an “ internal size-effect ’’ of internal boundaries in 
a metal is proposed and discussed in relation to a number of problems. 
The behaviour of the electrical resistance under a magnetic field is of 
particular interest and it is suggested that this may be relevant to the 
anomalous effects observed in gold (and other metals) at low temperatures 
and to the peculiar magneto-resistive characteristics of ferromagnetics. 


$1. INTRODUCTION. 


Tue limitation of electron mean free path in metals due to the physical 
size of the specimen is now well-known. A first approximate treatment 
for a thin “film” or plate was made by J. J. Thomson (1901) and more 
recently the analysis on the modern statistical theory of metals was given 
by Fuchs (1938) and for wires by Dingle (1950), Chambers (1950) and 
MacDonald and Sarginson (1950). 

It is however clear to-day that the internal physical structure of a metal 
is not entirely uniform and that grain, “‘mosaic”’ or dislocation boundaries 
will add to the electrical resistance as well as lattice vibrations and 
chemical impurity centres. It has been customary to regard such physical 
‘ defects ’ as simply contributing more or less uniformly to the electron- 
scattering without requiring any very specialized treatment. Recently, 
however, it has been found experimentally (MacDonald 1949) and shown 
theoretically (MacDonald and Sarginson (loc. cit.), Sondheimer (1950)) 
that the resistance-behaviour of a specimen limited by its external 
geometrical size is ‘‘ anomalous ” in that the resistance may be decreased 
by the application of a magnetic field. This occurs essentially when the 
size of the intrinsic electron orbit becomes comparable with the physical 
dimensions of the specimen. This then suggests that it may be valuable 
to consider the influence of internal boundaries in an analogous manner. 
Specifically, we propose that this may lead to an explanation for the peculiar 
behaviour of the resistance and magneto-resistance of gold and other 
metals at low temperatures and also for the anomalous magneto-resistance 
of ferro-magnetic metals. It also appears of possible importance in the 
interpretation of deviations from Matthiessen’s Rule which postulates 
the independent additivity of electrical resistivity components due to 
thermal and impurity scattering (in the general sense). 


* Communicated by the Author. 


On the Metallic Conduction—The “ Internal Size-Effect ” 757 


§ 2. InTERNAL Size-Errect Wrrnout a Magnetic FIEp. 


Let us consider a “large”? specimen of metal which has internal 
boundaries in the plane of current flow, partially effective in scattering 
electrons, which form layers of separation d and let J, be the mean free 
path of electrons due to scattering in the ‘‘ body ” of the metal. 

Let the x-axis be in the direction of the applied electric field, E, and 
the z-axis perpendicular to the plane of the “layers”. Let « be the 
fraction of electrons penetrating a layer without scattering, and B the 
fraction which are elastically scattered, the remainder being diffusely 
scattered. 

The appropriate Boltzmann (transport) equation for the perturbation, x, 
to the electron distribution function is then : 


0 E oN 
Veet = = 8, Re Pereetin % af ITC) 


where 7 is the relaxation time corresponding to J,, and N, is the unper- 
turbed distribution. By the nature of the problem it is evident that 
the solution will be periodic in z, of period d, and we need only consider 
any one particular layer taking z=0 at the lower plane. 

The general solution of (1) (cf. Fuchs loc cit.*) may be taken as : 


eHr ON, 
i= 
m Ov, 


[1 +(v,) exp (—2/7v,)] 


where ¢ is an arbitrary function. 
From symmetry it is then evident that » can be expressed : 


eEr ON, 
ae av, [1+¢4(v,) exp (—2/70,)], Uae 
== = [1+¢(—»,) exp ((d—2)/7,), 0, <0. 


If we now consider the electrons leaving z=0 with v,>0 it is clear 
from our boundary conditions that we may write : 


Be Xo (1+ g(o.))—a |< 5? (+402) exp (—d)r0, | 


eHr ON 

+8[ S* 21-4400) exp (—dln) |, 

x 

the first term on the right corresponding to electrons which have been 

transmitted ‘‘ upwards ” from the layer beneath and the second term to 

those which have been reflected at z=0 with reversal of the z component 

of velocity. 

eeneteeme ) ee ee ee 
* In fact Fuchs writes, more generally, ¢(v) but this is unnecessary in this 

type of problem. 


758 D. K. C. MacDonald on the 


Thus, setting p+ we have, solving for ¢(v,) : 


: 
eK ON, 1—y | er 
Sa a) OY Nie ot led 

i m Ov, E bes exp (—d/rv,) Pp ( [7 A) 


Sees: & (sam) aep (a—2)/,) | , <0. | 


Mm WW, 1— p exp (d,7,) 


(2) 
This is then just Fuchs’ solution for a thin film.if we replace his probability 
of elastic scattering at the surface (e) by p, the sum of the probabilities 
of penetration and elastic scattering, as is then evident physically. 
Consequently the relative conductivity o/o9, where op is the conductivity 
of the metal in the absence of such striation layers, is given by the curves 
computed by Fuchs (fig. 2, loc. cit.) setting « =p. ; 
It is of interest to note that we have then here a case where it is not 
possible to determine universally the effective resistivity of the specimen 


by a simple addition of inverse ‘‘ mean free paths”. Thus for d>J, 
we find : 

a 3(1—p) 

—~Il— - ay Se Re eal 
ee ae : ; 8d 
indicative of an effective mean free path, aaa due to the layer 


structure ; while for d<|J, : 
o 31+pd L 
eg a a oi” io ee 


from which one would deduce an “ apparent ’’ mean free path for the 


3 a esd, Ay. a : ea! 
layer structure : rr ¢ +) . log (2)) , which in addition varies with 7 ; 
It may therefore be useful to bear this point in mind when considering 
the significance of deviations from Matthiessen’s Rule in metals, or the 
combination of resistance due to impurity scattering and physical defects. 


§ 3. APPLICATION OF A MAGNETIC FIELD. 


If a magnetic field be now applied perpendicularly to the layers the 
Hall field will be in the plane of the striations and therefore uniform. 
Consequently, we may take over immediately the expressions derived 
by Sondheimer (loc. cit.) for the variation of conductivity in a “ thin 
film ”’ under a perpendicular magnetic field with the appropriate boundary 
equations. Initially an increase of resistance is then predicted as the 
magnetic field is applied but subsequently the resistance falls again and, 
under suitable conditions, may go below the initial value although the 
reduction is never great. This phenomenon can occur because the 
electron-paths are now less often disturbed by the physical boundaries 
than in the absence of the field. 


* See Sondheimer’s (loc. cit.) correction to Fuchs’ quoted formula. 


Metallic Conduction—The “ Internal Size-Effect ”’ 759 


If the magnetic field is applied instead in the plane of the striations 
(but still perpendicular to E) we must turn to the work of MacDonald 
and Sarginson (loc. cit.) on the influence of a magnetic field applied in 
the plane of a thin film. The Hall field is now generated perpendicular 
to the layers and will now be periodic as we traverse them. In the case 
of purely diffuse scattering the field will fall to zero at each boundary 
layer as in fig. 1 (b) and the behaviour of the metal will simply be that of 
a set of separate thin films of thickness d. In this case theory shows 
that the magnetic field can produce a large decrease of resistance tending 
ultimately towards the value characteristic of a metal without boundaries 
and this is confirmed experimentally (MacDonald loc. cit.). Partially 
diffuse scattering will evidently provide an intermediate character and 
the internal behaviour of the Hall field is sketched in fig. 1 (c), although 
detailed theory for thin films with partially diffuse scattering in this 
case has not yet been worked out. 


Big. 1. 
Ey 


Striations{ - 


nN 
: Tani value “Bulk” Value Z, “Bulk” value 
Z of Hall field 2 of Hall field of Hall field 
(2) (b) } (c) 

Sketch of Hall field in Sketch of Hall field in Sketch of Hall field in 
metal with no inter- metal with internal metal with internal 
nal boundaries. diffusely - scattering partly - scattering 

boundaries. boundaries. 


(Magnetic field perpendicular to plane of paper ; applied electric fleld 
along : ——~> : the paper.) 


- Finally a magnetic field applied parallel to the electric field (and 
therefore in the plane of the striations) produces no Hall field; again 
the resistance diminishes (due, crudely speaking, to the “ focusing ” 
action of the ‘magnetic field) without initial increase as has also been 
confirmed experimentally (cf. MacDonald loc. cit. and Chambers loc cit.). 
It should perhaps be emphasized that these discussions assume an 
ideal isotropic ‘“‘ electron-gas”’. The deviations from ideality in a real 
metal will produce a component of intrinsic “ bulk ” magneto-resistance 
which is essentially positive and may be considered very approximately 
as superimposed on the “ size-effect ’. If of course the intrinsic magneto- 
resistance is very large then the size-effects will be of small significance, 


760 | D. K. C. MacDonald on the 


In all these cases, then, the application of a magnetic field—no matter 
what its direction—may produce an anomalous decrease in the resistance 
of a metal if the intrinsic magneto-resistance is not too large. 


$4. Tur Low-TEeMPERATURE RESISTANCE OF GOLD AND THE 
MaGNETO-RESISTANCE OF FERROMAGNETICS. 

The electrical resistance of gold exhibits a definite minimum at low 
temperatures (e. g. de Haas, de Boer and van den Berg 1933, van den Berg 
1938) as also does magnesium (MacDonald and Mendelssohn 1950) 
although as yet much less thoroughly investigated (see also the recent 
work by Mendoza and Thomas (1951) on silver and copper as well). 
It is established that the location of the minimum in temperature is a 
function of the impurity present as measured by the residual resistance 
tending almost certainly to zero for an “ideally pure” specimen ; 
recently (MacDonald and Templeton 1951) it has been shown that for a 
given location in temperature of the minimum the associated residual 
resistance is lowest for silver as major impurity and progressively greater 
for copper and nickel. No theoretical explanation yet exists for this 
general behaviour. 

A remarkable feature is the anomalous magneto-resistive effect which 
has been noted by Giauque, Stout and Clark (1937) and has also been 
confirmed recently by the writer. Thus we find that for a gold specimen 
having a minimum at ~8°K., the magneto-resistive increase at 
~ 4-2° K. is +1-66 per cent for a field ~ 8 k gauss, while as the temperature 
is lowered the magneto-resistance diminishes relatively rapidly, becoming 
negative actually (~—0-19 per cent at ~ 8kgauss) at ~1:5°K. A 
related effect was first noted by Meissner and Scheffers (1929), and it 
was then suggested that this might have its origin in very small ferro- 

magnetic impurities. The later experiments appear to us to render this 
hypothesis unlikely. 

We are thus led to suggest that in these metals an internal size- -effect 
exists which under appropriate conditions can then account for the 
observed results. We conceive in this case of some kind of internal 
layer- or * block-”’ structure whose size is related essentially to the number 
(and kind) of impurity atoms present. A crude estimate suggests that 
the linear dimensions would be at least 10 times that of the dominant 
mean free path due to direct impurity scattering. Thus the specimen 
referred to above has a mean free path (at ~ 4° K.) of ~ 10-4 em., while 
the diameter of an electron orbit is a field of 10 k gauss is about 10-8 em. 

In order to explain the existence of the minimum of resistance in 
temperature in zero magnetic field it would be necessary to assume 
that the relative probability of diffuse scattering at the layer boundaries 
is still increasing slightly at low temperatures where the (‘‘ residual ’’) 
component of resistance due to impurity scattering is dominant and 
constant. No obvious mechanism accounts for this postulate but we 
recall the variation of (external) boundary scattering in narrow capillaries 
of sodium reported by MacDonald and Sarginson (1950) which suggests 


Metallic Conduction—The “ Internal Size-Effect ” 761 


that this form of scattering may not yet be fully understood *. If, 
however, the source of the minimum is to be found in this form of internal 
boundary scattering then since copper and nickel are progressively 
more effective impurity-centres for scattering in gold than is silver it 
appears quite natural that for a given location of minimum the residual 
resistances should in fact increase in the order observed. 

Finally, it is well known that the magneto-resistance of pure ferro- 
magnetic metals in bulk is anomalous. Thus at room temperature 
(e. g. Heaps 1911) the transverse magneto-resistance of iron first increases 
with magnetic field and then decreases, becoming negative for fields above 
~8kgauss. Since ferro-magnetics are of course characterized by a 
domain structure it seems quite reasonable to suggest that internal size- 
effect may here also be directly responsible, in part at least, for this 
behaviour. Since domain sizes are generally ~ 10-8 to 10-%cm.3 (e.g. 
Bates 1948) and therefore of linear dimensions ~ 10-3 cm. this again 
agrees with the magnitude of a free electron orbit in a field ~ 10 k gauss. 

We should point out, however, that Mott (1936) has interpreted the 
characteristics of the transition metals in terms of a two band (s—d) 
model and treated in particular the magneto-resistance of nickel. He 
ascribes the negative magneto-resistance in that metal to the distribution 
of spin-directions in the unoccupied d-states to which s-electrons may be 
scattered. 


ACKNOWLEDGMENT. 


I am grateful to Professor N. F. Mott, F.R.S., and Dr. E. Mendoza 
for comments. 


REFERENCES. 


Barss, L. F., 1948, Modern Magnetism, p. 249, Cambridge (University Press). 

Bere, G. J. van den, 1938, Thesis, Leiden. 

CHAMBERS, R. G., 1950, Proc. Roy. Soc. A, 202, 378. 

Dincxez, R. B., 1950, Proc. Roy. Soc. A, 201, 545. 

Fucus, K., 1938, Proc. Camb. Phil. Soc., 34, 100. 

GravQuE, Stout and CxiaRK, 1937, Phys. Rev., 51, 1108. (See also Stout and 
BaRiEAv, 1939, Jour. Amer. Chem. Soc., 61, 238.)- 

DE Haas, DE Borr and v. D. Brre, 1933, Physica, 1, 1115. 

Heaps, 1911, Phil. Mag., 22, 900. 

MacDona.p, D. K. C., 1949, Nature, Lond., 163, 637. 

MacDonatp, D. K. C., and Menpexssoun, K., 1950, Proc. Roy. Soc. A, 202, 

523. 

MacDona.p, D. K. C., and Sarernson, K., 1950, Proc. Roy. Soc. A, 208, 223. 

MacDonatp, D. K. C., and Tempteton, I. M., 1951, Phil. Mag., 42, 432. 

Metssner and ScHEFFrErs, 1929, Phys. Zs., 30, 827. 

Menvoza, E., and Tuomas, J. G., 1951, Phil. Mag., 42, 291. 

Mort, N. F., 1936, Proc. Roy. Soc. A, 153, 713. 

Tomson, J. J., 1901, Proc. Camb. Phil. Soc., 11, 120. 


* The recent experiments of Mendoza and Thomas (loc. cit.) which show 
that the resistance-increase below say 1° K. can become rather rapid cast 
doubt on the likelihood of such a mechanism. 

+ This discussion is of course rather crude since we have not considered the 
effect of the “internal ”’ field. 


nf 62 
LXXVIII. The Radiations of Hg as Observed by a New Method. 


By H. W. Witson and 8. C. CURRAN, 
Department of Natural Philosophy, The University, Glasgow*. 


|Reccived April 30, 1951.] 


SUMMARY. 


The radioactivity of mercury has been investigated in a new way 
which makes use of the integrating properties of a proportional counter 
with an internal source. These allow us to show that the 6 spectrum 
is of allowed shape from 20 KeV. to the end point which was found to 
be 210 KeV.+5 KeV. The energy and internal conversion coefficients 
of the y ray which follows the B-emission have been evaluated. Also 
the half life has been determined as 45-9-+-0-5 days. A discussion of 
the decay scheme and further possible use of the technique including its 
extension to scintillation counters concludes the paper. 


$1. INTRODUCTION. 


THE radiations emitted by the isotope 7?Hg have been studied by a 
number of investigators (Friedlander and Wu 1943, Miller and Curtiss 
1945), and in some detail by Saxon (1948) and Slatis and Siegbahn (1949). 
The essential features of the decay scheme are well established and fig. 1 
exhibits these diagrammatically. A negative B-particle of limiting energy 
close to 208 KeV. is emitted leaving the residual nucleus TI excited to 
an energy of approximately 280 KeV. The gamma ray transition is 
strongly internally converted in both K (~20 per cent) and L, M,N... 
shells (~6 per cent). No direct B-transitions between the ground states 
of the nuclei have been observed. Both Saxon and Slatis and Siegbahn 
have studied the form of parts of the spectrum by means of the magnetic 
B-spectrograph. Saxon concluded it was of allowed form over the range 
from ~40 to ~140 KeV. (the rest of the spectrum being masked by a 
strong conversion line near the end-point) and with the help of a 
B-y coincidence technique Slitis and Siegbahn largely eliminated the 
difficulties due to the presence of the conversion line. They were thus 
able to show that the form of the spectrum was allowed in the region 
of 200 KeV. : 

In the present work a new method of attack has been adopted. The 
integrating features of the proportional counter are such that three 
distinct possibilities arise. Thus, with a source situated within a 
proportional counter we see that (1) the unconverted y-rays largely 
escape from the detector since the efficiency for such radiations is 
<1 per cent and the f-rays of the primary spectrum are recorded ; 


* Communicated by the Authors. 


On the Radiations of Hg as Observed by a New Method 763 


(2) the y-rays converted in the K shell give rise to a photo electron of 
energy approximately 200 KeV. and the counter detects both the 
photoelectron and time coincident f-particle each with 100 per cent 
efficiency so giving rise to a second spectrum, displaced on the energy 
scale by about 200 KeV.; (3) the y-rays converted in the L, M,N... 
shells give rise to photoelectrons of energy ~270-280 KeV., time 
coincident with the primary f-rays and so producing a third spectrum 
displaced by about 270-280 KeV. 

There are refinements to the above picture. For example, the efficiency 
of the detector to the K-radiations of Tl was ~2 per cent and, hence, 
occasionally processes of group (2) above will be displaced into group (3). 


Fig. 1. 


H,703 T) 203 


6° 
208 KeV 


8 
280 KeV 


Decay scheme of ?°Hg. 


The L, M, N .. . radiations normally emitted together with the 
K-radiations are almost completely detected and so groups (1) and (2) 
are usually expected to show a relative displacement of hv,—hy,,,, 
i.e. 208 KeV. Regarding group (3) the L, M, N . . . radiations are 
likewise captured within the tube and, hence, the whole group is displaced 
relative to group (1) by almost exactly hv,, i.e. 280 KeV. The three 
groups are marked, I., I. and III. in fig. 3. This phenomenon of the 
successive appearance of the primary spectrum affords some very 
considerable advantages. Thus, the beginning of the second spectrum, 
group (2), shows the detailed shape of the spectrum at low energies 
(see AB of fig. 3). The final end point of group (3), shows the detail 
of the spectrum near the limiting energy, free from confusion due to the 
superposition of the K conversion line (Inset curve, fig. 3). ‘The whole 
spectrum can thus be examined in detail and compared with theory. 


764 H. W. Wilson and S. C. Curran on the 


Lastly, the relative intensities of the three groups can be estimated by 
comparing areas and so the K and L, M... conversion factors 
determined rather exactly. 


§ 2, EXPERIMENTAL ARRANGEMENTS. 


The experimental system for examination of the spectrum was as 
shown in fig. 2. A large counter with the field adjusting tubes described 
elsewhere (Cockroft and Curran 1951) was placed within a uniform 
magnetic field (intensity 1400 gauss) parallel to the wire (Curran 1950). 
The working length of the counter was 8in. and the diameter 5) in. 
The source was mounted on a separate probe terminating in a circular 
wire loop } in. in diameter. The plane of the loop was parallel to the end 


Fig. 2. 


MAGNETIC FIELD 


CASE (COPPER) SOURCE 


'N ——————— === 5 
\ ‘ Fi / 
\ ie 
END CORRECTION TUBE / GUARD TUSE 
4MIL TUNGSTEN WIRE 
iN wny 
pe eS 
6 


Experimental arrangement of proportional counter in magnetic field. 


faces of the counter. The loop was used to support a thin foil of aluminium 
of mass 0-16 mg./cm.? and the radioactive material consiste] of HgS 
which had been sublimed on to the foil from a heated surface in vacuo. 
The mass of the radioactive material was about 3 » gm./em.2 and almost 
invisible. The total activity of the source was 12,000 counts per minute. 
The source was confined to the central portion of the foil and the recorded 
rays were emitted into 47 solid angle, half of the total number passing, 
of course, through the thin supporting film. The probe was parallel to 
the central wire of the tube and the voltage on the probe was adjusted to 
the correct value appropriate to its position in the counter (1:l in. from 
the central wire). The central wire was 4 mil. in diameter and the argon 
gas was at a pressure of 37 atmospheres (containing methane to a partial 
pressure of 20cm. of mercury). The pressure and the magnetic field 
were such that no particles from the source could reach the cylindrical 
wall or escape into the non-counting volume at the ends of the tube. 


NUMBER OF PULSES PER ENERGY INTERVAL 


Radiations of ?°°Hg as Observed by a New Method 765 


Calibration of the counter was effected with a beam of X-rays, 
consisting mainly of the characteristic K radiations of tungsten, directed 
at the wall. The pulses from the counter were amplified linearly and 
displayed on a cathode ray oscillograph. For the spectrum analysis 
two methods were used: (1) the pulses were recorded photographically 
on moving film and analysed by reprojecting on a measuring screen ; 
(2) the pulses from the amplifier were passed into a single channel 
electronic kick sorter. The results obtained by both methods were in 
good agreement and the curves given below are representative of those 
obtained by both techniques. 


Fig. 3. 


300 


q 
w 
FE 
Zz 
> 
fexexe) 5 
w 200 
lor 
& 
1) 
Ww 
BOO Y 
= 
a 
6 100 
fe 
a 
600 S 
= 
ee 
400 
200 
ie) 100 200 300 400 500 


Complete spectrum of pulses observed in pressurized proportional counter. 
. Detail near end-point shown in inset figure. 


The form of the spectra obtained is as shown in fig. 3. It is obvious 
that the primary spectrum is repeated as expected. The spectrum 
includes the spectrum of background particles (~700 counts/min. total). 
This, however, was deducted in making the calculations for the Fermi 
plot and the internal conversion coefficients. No serious effort was made 
to carry the analysis of the group (1) particles to very low energies since 


766 H. W. Wilson and S. C. Curran on the 


the beginning of group (2) gives the shape in the very low energy region. 
Nevertheless the spectrum was looked at with higher gain to give the 
shape of the low energy region more exactly and the number of counts 
per energy interval below 130 KeV. in fig. 3 are taken from these results. 
The ultimate end-point of the distribution is shown in more detail in the 
inset figure. 

Fermi plots, fig. 4, were made as follows: (1.) for the middle range, from 
group (1); (IL.) for the end-point region from group (3); (IIL) for the 
low energy region from the start of group (2), 7. e. the portion AB of the 
spectrum. The intensities read from the spectrum were corrected for the 


Fig. 4. 


PORTIONS MARKED THUS 
° FROM GROUP! FIG 3 
v FROM PORTION AB FIG 3 
, < FROM END POINT INSET FIG3 


& 


Composite Fermi Plot for 2°*Hg. 


background spectrum in making the calculations. The @ term in the 
ordinate function is taken from a paper by Bleuler and Ziimti (1946). 
The fitting of these three regions together results in a good straight line 
from ~20 KeV. to the end-point which we estimate to be 210-+5 KeV 
The final end-point of the histogram gives a more precise value of the 
limiting energy of the B-rays provided the energy of the y-radiation is 
known. This was separately determined (278 KeV.) as discussed below 
and the B-ray end-point again found to be 210+5 KeV.. This agrees 
with the value given by Slitis and Siegbahn. The Fermi plot is ghaws 


Radiations of ?°°Hg as Observed by a New Method 767 


as linear to zero energy, the point at 0 KeV. being calculated from the 
corresponding point on the curve AB, fig. 3. In view of the aluminium 
leaf source support, this linearity to such a low energy must be regarded 
as accidental. 


§ 3. ADDITIONAL Data. 
Half Infe. 

There appeared to be some considerable variation in the value of the 
half life of ?°Hg as determined by different authors (Friedlander and 
Wu 1943, Saxon 1948) and a careful measurement was made. Two similar 
Geiger counters were enclosed in a single envelope constructed in such a 
fashion that each could be individually irradiated. One was exposed to a 


Fig. 5. 


MERCURY SOURCE 


I" LEAD SHIELD 


/ 


CARBON SOURCE 


5 CMS 
Controlled Geiger tube apparatus used for half-life determination. 


source of 23Hg (as mercuric oxide) and the other to a source of “C (as 
barium carbonate). The latter was intended as a constant control. 
Actually, slight temperature-dependent variations appeared in the 
effectively constant counting rate observed for “C. These did not show 
in the curve of decay observed for 2%Hg. These variations were allowed 
for by finding experimentally the law relating the “*C counting rate 
to temperature and reducing all '4C readings to a standard temperature 
of 17°C. The apparatus is sketched diagrammatically in fig. 5. It was 
designed so that the chance of a particle from one source being scattered 
into the other counter was extremely small. The two counters shared 
a common gas filling. The lead shield was used to keep mercury y-rays 
from affecting the standard 4C counter. 


768 H. W. Wilson and S. C. Curran on the 


The half-life, after correction for counting losses due to dead time, etc., 
was found to be 45:9-+40-5 day. This lies within the range of previously 
carefully measured values and ensured us that the isotope was effectively 
pure. It was prepared for us at A.E.R.E. by bombarding liquid mercury 
with slow neutrons. The half-life graphs are shown in fig. 6. Graph (a) 
is the logarithm of the mercury counting rate (after correction for dead 


Fig. 6. 


2-4 

Se 
L9G 
Zz 
Je 
23] ¥ 
le) 
87 

2-24 


10) 10 20 30 40 sO 60 70 
DAYS 


Graph of half-life results. 


time losses) plotted with respect to time as abscissa, while (b) is the 
logarithm of the ratio of the mercury counting rate to the standard carbon 


counting rate. (a) gave a value of 45-84 day and (b) a value of 45-95 day 
for the half life. 


Energy of Gamma Radiation. 

A semi-circular magnetic spectrograph (resolution 2 per cent) was 
employed. A thin platinum radiator (4 mgm./em.2) was wrapped round 
the source. A histogram (fig. 7) gave strong K and L peaks and the 
value of the y-ray energy, was found to be 278-+3 KeV. 


Radiations of ?°°Hg as Observed by a New Method 769 


Absorption Curves for B-rays and Electrons. 

Fig. 8 shows the result of some measurements on the f-rays and 
conversion electrons. The absorber mass scale has been corrected for 
counter window thickness and air gap (total ~6mgm.). A good 
end-point is apparent and, according to a graph of Glendenin (1948), 
the range in the absorber corresponds to an energy of 280 KeV. This 


ig 7s 


5 fo) ‘5 
FLUXMETER DEFLECTION (F) 


Photo-electric spectrum of the y-ray of *°°Hg, 
with 4 mgm./cm.? Pt radiator. 


confirms the y-ray measurement, as there must be photo-electrons 
released with practically the whole quantum energy. A slight break is 
apparent on the absorption curve at about 210 keV., the limiting energy 
of the B-rays.. This portion of the graph is not included in the figure. 


SER. 7, VOL. 42, NO. 330.—JULY 1951 3a 


770 H. W. Wilson and 8. C. Curran on the 


We were anxious to observe possible evidence of the emission of B-rays: 
in the ground state to ground state transition, but no such evidence 
appears. From examination of the absorption curve and other data, we 
believe that the transition to the excited state is more probable than 
the ground to ground state transition by a factor >100. 


§ 4. DIscussION. 


The study of the disintegration of ?°°Hg was considered interesting 
primarily because of the possibility of determining the shape of the 
B-spectrum at very low energies with the help of the new integrating 
feature of the proportional counter. Relatively soft f-emitters of 
high Z value such as 28Hg have not been examined in detail at low 
energies in the past and possibly they could exhibit rather unusual features. 


es: 


10000 


COUNTS PER MINUTE 


40 60 80 100 120 
MGM/CM? OF ALUMINIUM 


Absorption of beta rays in aluminium. 


in this region. Thus, we might expect relatively large effects from the 
extra-nuclear electrons or again radiative transitions of the type recently 
discussed by Ivanenko and Lebedev (1950) might be prominent. It is, 
therefore, particularly notable that the allowed form of the spectrum 
is rather closely preserved down to ~20 keV. as shown above. Still 
more refined examination of the spectrum at low energy values will be 
possible when the source is mounted on still thinner supporting films and 
when the resolution of the instrument is improved. It is thought that 
traces of electronegative gases impaired the resolving power and a 
continuously active purifier is being fitted to the counter. 

The ft value of °°Hg, assuming t=46 day and f=0-6 (from the curves 
of Feenberg and Trigg (1950)) is 2.4108 giving log fi=6-38. It is 
not completely obvious from this value whether 2°%Hg is allowed or 


Radiations of °Hg as Observed by a New Method 771 


forbidden although Feenberg and Trigg remark with regard to fig. 10 of 
their paper, that the peak at log ft=62 is “ predominately first 
forbidden”. If we fit the data to the Sargent diagrams (for 80<Z <84) 
due to Feather and Richardson (1948) we find that a first forbidden 
transition with 4I——1 and a change of parity is suggested. In either 
case the allowed form shown by the spectrum is to be expected 
theoretically. 


Internal Conversion Coefficients. 

By comparing areas under the three spectra (knowing the Fermi plot 
is straight allows accurate separation of pulses in group (2) from those in 
group (3)), we estimate the conversion factors (defined as the number of 
conversion electrons divided by the total number of y-ray disintegrations) 
to be 


Oy =15-6 percent; a,, y...—4:2 percent; «19-8 per cent 


b 


and the ratio ax/oz,, y.--=3°71. 

We note that Saxon gave the « value as 0:24. If we define « in the 
more usual manner as the ratio of the number of conversion electrons to 
the number of y-rays emitted, our value of « rises to 0-247, in good 
agreement with Saxon. The work of Slatis and Siegbahn gives the 
ration 3-0 for a/a,. 

The value of «, predicted theoretically by Taylor and Mott (1932) 
for (mc?/hv) =1-83 is close to 14 per cent for electric quadrupole radiation 
at Z=83. Here we have Z=81. The results suggest strongly that the 
y-ray transition in the TI nucleus is electric quadrupole in type. 


Shell Model. 

The shell model predictions do not seem to be sufficiently precise in 
the region of atomic number >80 to assist materially in the assignment 
of spin values. It is reasonable, however, according to present views, to 
suppose that 2°Hg has a spin of $ or 3 and it is much less likely to have a 
still greater spin value. Moreover, and rather more significantly, the 
spin values of almost all nuclei in the neighbourhood of mercury and 
thallium are in the range O<I<3. The ground state spin value of 
203T] has been measured and has the value 4. The spin value of ?°'Hg 
in the ground state is 3. It is very reasonable to suppose that the 
addition of two neutrons to form ?°%Hg should leave the spin unchanged 
at the value 3. If we assign this value to °°Hg and 3 to ?%Tl we can 
tentatively construct the decay scheme shown in fig. 9 (a). Here the 
y-ray transition is obviously in line with the experimental facts among 
which has to be noted the conclusion that the life time of the state at 
0-28 MeV. is <0°3 x 10-8 sec. (Deutsch and Wright 1950), or <2 x 10-8 sec. 
by an independent experiment of Binder (1950). The f-ray transitions 
are both allowed according to Gamow-Teller rules but experimentally 
the (—1, no) transition is apparently forbidden relative to the (0, no). 
This is clearly the weak link in the scheme and it may be that here the 
application of the shell model is misleading. It seems to us that the 


3Q2 


Ti2 H. W. Wilson and S. C. Curran on the 


decay might be more correctly represented by the scheme shown in 
fig. 9 (b). The spin value 5/2 is assigned to *Hg and it is now possible 
to assign a spin change of —1 and a parity change to the observed 
B-transition, bringing it into agreement with the conclusions of Feather 
and Richardson. The ground state to ground state f-transition is 
(—2, yes). There is evidence that transitions of the type (4I=2, yes) 
have peculiarly large (Ej2—1) ft values, of the order of 10’ (Shull and 
Feenberg 1949, Wu 1950). Now, according to our observations, the 
f-transition of energy 0:49 MeV. is less probable than the observed 
transition of energy 0:21 MeV. by a factor 2100. The energy releases 
favour the higher energy transition by a factor of about 17, according to 
calculations of Feenberg and Trigg in the paper referred to previously. 
Hence, the ft value for (4I=—2, yes) has to exceed the ft value for 
(AI=—1, yes) by a factor 21700. This is clearly possible if the 
respective values are ~101° and 2-4 x10® respectively. On the whole, 


Fig. 9. 


H 203 T1203 H 9 i 
3 
2 


2 
2 


Pp :2i O,no 


p49 hv :28 
-I,no -I,no EQ. 


NE 


(b) 


Possible spin and parity changes in decay of 2Hg. 


therefore, the scheme of fig. 9 (b) is perhaps the most satisfactory from 
the point of view of both experiment and theory. The main point of 
difficulty which it introduces is its inconsistency with the shell model. 
The method of classification of B-emitters due to Dr. K. M. 
Guggenheimer of this department suggests that the 0-21 MeV. transition 
is in the allowed class. His work indicates that the 0-49 MeV. 
transition will be less probable than the low energy transition by a 
factor of ~224 if the first is second forbidden and the second allowed. 

A scheme satisfying these conclusions is that of fig. 2 (b) with the 
f-transitions changed from (4I=—2, yes) and (4I=—1, yes) to 
(AI=—2, no) and (4AI=—I1, no). This introduces, of course, disagree- 
ment with the classification due to Feather and Richardson. Much of the 
uncertainty as to the decay scheme would be removed by the experimental 
measurement of the spin of ?Hg, 


Radiations of ®8Hg as Observed by a New Method 773 


§ 5. GENERAL REMARKS. 


The method offers to be one of considerable value in those cases of decay 
in which conversion electrons are of greater energy than the primary 
B-rays. Under such conditions the separate spectra observed give details 
of the low energy region of the primary spectrum and conversion coeffi- 
cients can be readily evaluated. Work is in progress with RaD, which 
probably satisfies the required condition. 

Similar considerations apply to decay schemes in which the y-radiation 
following the B-emission is not appreciably converted but can be detected 
with high efficiency. Obviously, the most suitable detector in this case 
is the scintillation counter. Thus, if a very thin radioactive source is 
placed between two crystals of, say, sodium iodide (thorium-activated) 
the arrangement constitutes a solid proportional counter identical in 
principle with the gas filled proportional counter arrangement discussed 
above. The near 100 per cent efficiency for soft y-rays of the scintillation 
detector permits its successful application to cases in which appreciable 
internal conversion does not take place. We are thus enabled to make use 
of the integrating effect of a proportional counter (with 47 solid angle) in 
this case also. As examples of the possibilities of the scintillation tech- 
nique, we note that it is easy to detect almost 100 per cent of the K X-rays 
of Tl and not very difficult to approach the same complete detection of 
y-rays of energy 0-28 MeV. In such circumstances, the group (1) and 
group (2) pulses, as shown in fig. 3, are displaced towards the right and 
both coincide with group (3) giving a single spectrum with starting point 
at 278 KeV. and end-point at 488 KeV. approximately. Preliminary 
work on mercury with a scintillation counter arrangement, as described 
above, has resulted in the removal of group (2) leaving a distribution 
consisting only of group (1) and a strong group (3). Further work with 
larger crystals will allow us to concentrate all the radiations into a single 
spectrum. This work will be published separately. Obviously in this 
case the examination of the B-spectrum is considerably facilitated. Indeed 
deliberate variation of the sensitivity of the arrangement to the different 
types of radiation can be used to assist in the analysis of decay schemes. 
Examples of such sources can be instanced, for example, 7Cs. Another 
example occurs in the case of RaD where we can detect with nearly 100 per 
cent efficiency the hardest y-rays, of energy 46-7 KeV., and so obtain a 
true picture of the primary f-spectrum displaced along the energy axis by 
the same amount. This displacement would enable us to observe the 
complete soft spectrum in some detail. Clearly, both the gas filled and 
solid type (scintillation) proportional counters offer very interesting 
possibilities if their integrating action is turned deliberately to good 


account. 


We wish to thank Dr. K. M. Guggenheimer and Mr. D. L. Pursey of this 
department for helpful discussions on the theoretical side of this work. 


774 On the Radiations of ?°Hg as Observed by a New Method 


REFERENCES. 


FRIEDLANDER, G., and Wu, C. S., 1943, Phys. Rev., 63, 227. 

Muer, L. C., and Curtiss, L. F., 1945, Pu. Proj. Rep. CP-3102 (August). 

Saxon, D., 1948, Phys. Rev., 74, 849. 

SLAtis, H., and Sreqpann, K., 1949, Phys. Rev., 75, 318. 

CockroFt, A. L., and Curran, S. C., 1951, Rev. Scr. Inst., 22, 37. 

CurRRAN, S. C., 1950, Atomics, 1, 221. ' 

Biever, E., and Zunti, W., 1946, Helv. Phys. Acta, 19, 375. 

GLENDENIN, L. E., 1948, Nucleonics, 2, 12. 

IvANENKO, D., and LepepeEv, V., 1950, Jour. Exp. Theor. Phys. U.S.S.R. 20, 
91-2. (Jan.), 

FEENBERG, E., and Trice, G., 1950, Rev. Mad. Phys., 22, 399. 

FreatTser, N., and Rickarpson, H. O. W., 1948, Proc. Phys. Soc.,61, 452. 

Taytor, H. M., and Mort, N. F., 1932, Proc. Roy. Soc., 138, 665. 

Devutscu, M., and WricxT, W. E., 1950, Phys. Rev., 77, 139. 

BinveEr, D., 1950, Phys. Rev., 78, 325. 

SHvuLt, F. B., and Fernperc, E., 1949, Phys. Rev., 75, 1768. 

Wu, C.8., 1950, Rev. Mod. Phys., 22, 386. 


aber 


LXXIX. Microscopic Studies on Beryl Crystals.—I. 
Observation of Uni-Molecular Steps. 


By L. J. Grrrrry, 
H. H. Wills Physical Laboratory, Bristol*. 


[Received May 1, 1951.] 
[Plates XXII. & XXIII] 


SUMMARY. 


Microscopic methods for observing uni-molecular steps on beryl 
crystals are described and discussed. Possible causes of the visibility 
of the steps are considered. 


§ 1. GENERAL. 


In the course of topographical studies of prism {1010} faces of beryl, 
it was observed that steps which interferometric measurements proved 
to be only a few hundred angstroms in height could be detected by the 
naked eye. This led to the development of a simple microscopic technique 
for the observation of such steps and it was then found that prism faces 
of beryl crystals often showed an extensive line structure which behaved 
in every way as if it corresponded to a step structure. 

Individual lines of this structure could not be detected by multiple 
beam interferometric methods, but within the limits of error of this 
method the lines were observed to follow contour lines of the surface, 
being widely spaced when the surface was relatively flat and crowding 
together on gradients. In particular there were characteristic surface 
features, the contours of which followed roughly circular arcs ; the line 
structure on such features also followed circular arcs. 

Faces of beryl were examined in their natural untouched state and 
after the deposition of a highly reflecting thin film of silver on the surface. 
Tn all cases the appearance and behaviour of the line structure was the 
~ same, the sole difference being that the visibility of the lines increased 
considerably when the surface was silvered. These lines must therefore 
be genuine geometric surface details and can only represent surface steps. 


§2. HnIGHT OF THE OBSERVED STEPS. 
It was obviously desirable to determine the height of these steps, but 
even when very sensitive multiple beam interferometric methods were 
used this did not prove to be easy. However it was easily possible to 


* Communicated by Professor N. F. Mott, F.R.S. 


776 L. J. Griffin on the 


fix an upper limit to their height, for when a fringe crossed a line no 
kink in the fringe was visible. This meant that the steps could not be 
higher than 20 A. 

In principle any area of a face containing two sections, one of which 
showed many steps, and the other few steps could be used to determine 
the average height of the steps. This is possible by determining the 
angle between the two sections by means of fringes of equal chromatic 
order and then counting the number of steps in equal lengths of the two 
sections from a photomicrograph. In practice there were two serious 
difficulties. Firstly it was normally necessary to work at high magni- 
fications on areas of this kind, and this could result in considerable 
experimental difficulties in the interferometric observations and 
measurements. Secondly it is essential for the accuracy of the inter- 
ferometric measurements that the position of the image of the spectrograph 
slit on the crystal surface should be known accurately. In this case it 
was not possible to make reference marks on the surface of the crystal. 

One area was discovered when these difficulties could be avoided. 
In this case a large number of steps suddenly turned through approxi- 
mately 60°, thus producing the two different gradients, while the area 
lay between two obvious topographical features which could act as 
reference marks. The actual angle measurements was made by a method 
using both multiple beam Fizeau fringes and fringes of equal chromatic 
order (Barakat 1949) and the dihedral angle between the two sections 
was found to be tan-! 0-57 10-8. The position of the image of the slit 
of the spectrograph having been determined, a high magnification 
photomicrograph showed that in one section the slit had crossed some 
110 steps in 1-04 10-2 cm. (in approximately the direction of greatest 
slope). In the other section 40 steps were crossed by the slit in an equal 
distance. Therefore for 70 steps the average step height is 8-5 A+1 A. 
The main error in this measurement was due to the uncertainty in the 
number of steps crossed by the slit. In some cases steps which were 
apparently single could be resolved into two steps after turning through 
60°, and in several instances it was not possible to decide whether steps 
were truly single, or unresolved double, steps. 

The building unit of beryl appears to be the Si,O,, ring with the 
attached two Al and three Be ions which make. half the unit cell as 
determined by Bragg and West (1926). These building units are packed 
side by side in the basal plane of the crystal and are stacked one on top 
of the other in the direction of the hexad (c-) axis, the orientation of 
alternate Si,O,, rings along this axis differing by 25°. The height of 
these units is 4-6 A. and their diameter 7-9 A. The latter is the dimension 
which will appear as a step height on the {1010} faces examined. 
Comparing this value of 7-9 A. with the experimentally determined value 
of the step height it is obvious that the steps observed are uni-molecular. 

Although this determination was made only on one small area there 
appears to be no reason to doubt that the result is generally true. The 


Microscopic Studies on Beryl Orystals.—I. res 


upper limit which is generally applicable fixes the maximum height at 
2 molecules and a considerable amount of crystallographic evidence 
strongly supports the conclusion that the steps on the prism faces observed 
are almost always uni-molecular (Griffin 1950, 1951). There are 
exceptions to this rule, but they can normally be recognized and some 
cases are discussed below. 


§ 3. TECHNIQUES OF OBSERVATION. 


All three of the simple microscopic methods used for step observation 
were used in reflection because 

(i.) any difficulties due to imperfections and flaws in the body of the 
crystal were thereby avoided ; 

(ii.) the optical height of the step was thereby increased. For a 
specimen in air the optical step height in transmission is (u—1)t and in’ 
reflection 2¢ and in the case of beryl this is equivalent to an increase in 
optical height of 3-5 times. 

For the best results the crystal face to be examined is first coated with 
a thin film of silver. The reflectivity of this film is not critical and 
values of 80 per cent—90 per cent are convenient. The silver is deposited 
under conditions normal for multiple beam interferometry, i.e. the 
surface is carefully cleaned and the silver deposited by thermal evaporation 
in a vacuum of pressure about 10->mm. Hg. The presence of the silver 
layer results in 

(i.) a considerable increase in the visibility of the diffraction pattern 
marking the edges ; 

(ii.) a large reduction of the relative intensity of parasitic illumination 
produced at interfaces in the optical train. 


1. Bright field narrow pencil illumination. 

With this method a normal type of metallurgical microscope was used 
with the field iris stopped down to give a fairly narrow illuminating beam, 
the steps being visible immediately the crystal surface came into focus. 
The optimum aperture of the field iris, which varies a little according to 
the step height, can be very easily determined by closing down the iris 
as far as possible and then slowly opening it again. At some point 
a sudden decrease in visibility of the steps will be noticed and it is 
advisable to keep the aperture stopped down well below this value. 

This method was used throughout the whole of the early part of the 
work and the effect of the focal setting of the microscope objective was 
investigated. It was found that 

(i.) this diffraction phenomenon is an ‘“‘in-focus” effect. When the 
surface of the crystal was truly in focus the diffraction pattern took its 
simplest form, had the least lateral extension and maximum peak 
intensities—the criteria usually applied to judge whether an image is 
ite OP ee Olt. Of); 100s; 


778 L. J. Griffin on the 


(ii.) that the intensity distributions are not complementary about the 
in-focus position. By analogy with the usual method for observing small 
path differences in a transparent body with a transmission microscope 
the contrary would be expected. An asymmetry of this kind might result 
from tilt of the specimen or deviations from axiality in components of 
the microscope. However the variation of focus about the centre, as 
observed on a photographic plate, appeared to be radially symmetric 
and with the objective used (focal length, 4 mm., N.A. 0-95) this should 
constitute a fairly sensitive test for any kind of deviation from axiality ; 

(iii.) that because of the non-complementary intensity distribution and 
its lateral asymmetry, it is possible in principle to determine the direction 
of a step by simple inspection of the diffraction pattern. 

In practice, as discussed below, the visibility of the pattern for different 
steps can vary considerably, and with low visibility patterns it is often 
difficult to determine the step direction. 

(iv.) that the relation between the phase and amplitude of the direct 
and diffracted waves is complex and varies with the focal setting. In 
the in-focus position a step is imaged by a narrow minimum with a broader 
maximum on the low side of the step; when the focal surface of the 
objective was a few microns from the plane of the face, up to nine maxima 
and minima could be observed. Major changes in the intensity distri- 
bution of the pattern took place for changes in focal setting of less than half 
a micron. ; : 

The type of in-focus imaging mentioned above required that a narrow 
and not very deep minimum be observed, as the maximum had a very 
poor visibility against the general bright field. This was necessarily 
fatiguing and it was found more convenient to work slightly off focus, 
as this resulted in the minimum becoming broader and more obvious, 
and therefore easier to see against the bright field. 

2. Oblique dark field illumination. 

Uni-molecular steps were also found to be visible using a normal type 
of reflection dark-field illumination. Although the visibility of the steps 
theoretically has a maximum value in dark field, it was in fact found to 
be much more convenient and satisfactory to use either of the other 
methods of observation. This was because foreign material was invisibly 
found to be deposited on the crystal faces and was visible as a random 
surface marking. Consequently not only did the step edges scatter light 
into the objective but the deposits of foreign material, whether point or 
large-scale, also did so, and as the intensity of light scattered by the steps 
was very low this randomly scattered light tended to mark the .step 
images. 

3. Phase contrast. 


This method has been found to be the most sensitive and generally 
most satisfactory. A commercial type of reflection phase-contrast 
equipment with a very slightly absorbing phase ring has been used. 


Microscopic Studies on Beryl Orystals.—1 779 


This results in the steps being marked by a very deep and narrow 
minimum and a lesser maximum. This minimum is much deeper than in 
the narrow pencil method (i. e. the visibility is much higher) and the 
width of the whole pattern is much less. Because of the increased con- 
trast of the “image” it is possible to work with the crystal surface 
in sharp focus, thus resulting in much crisper definition and higher 
resolving power. The phase contrast equipment is, in this case, not 
working in the way for which it was designed for there is no question of 
any general difference in intensity on either side of the uni-molecular 
steps. It is, however, understandable that the visibility of the diffraction 
effect should be greater in phase contrast conditions than with the narrow 
illuminating pencil method. | 


§ 4. Errect or NUMERICAL APERTURE OF OBJECTIVES. 


The visibility of the diffraction patterns varies with the value of the 
N.A. of the objective. Comparisons of visibility were made visually and 
are therefore necessarily very approximate. 

One of the main complicating factors was that a charge of numerical 
aperture normally necessitated a change in focal length of the objective. 
‘This is almost invariably the case in microscopy and in the discussion 
below a correspondence will be assumed between increasing N.A. and 
decreasing focal length. Thus a relationship existed between the lateral 
extent of the diffraction pattern and the value of the N.A. This probably 
resulted in a decrease of visibility with increasing N.A., but against this 
one must balance the tendency of the eye to prefer an easily resolved 
pattern. These limitations must result in an estimate of the optimum 
observing N.A. which is very tentative. 

Steps of 7-9 A. were observed with objectives having an upper limit 
of N.A. of 1:3 and a lower limit of 0-08. The maximum visibility was 
obtained with N.A.’s lying between 0-45 and 0-15. 

Steps were also observed on terminal faces of beryl (0001), the 
diffraction patterns in this case always being of much lower visibility 
than in the case of 7-9 A. steps. It was not possible to make any measure- 
ment of step height in this case but from crystallographic evidence it 
appears possible that these steps may only be 4-6 A. high. Two additioual 
factors may be considered in this case : 

(;.) As the visibility of the pattern has decreased considerably the eye 
may be able to estimate more easily the onset of ‘‘ empty magnification ” 
due to the use of an objective of unnecessarily high N.A. 

(ii.) The angular spread of the diffracted wave fronts may be dependent 
upon the height of the step. The difference between simple steps of 
7-9 A. and 4:6 A. however, is not likely to be great. 

In this case an observing N.A. of 0-28 definitely resulted in a higher 
visibility than N.A.’s of 0-45 or greater. The experimental conditions 
did not permit of the determination of a lower limit to the observing N.A. 


780 L. J. Griffin on the 


The same range of N.A.’s was not available with the phase contrast 
equipment, but the same conclusions as to the observing N.A. apply 
at least qualitatively. 

Thus maximum visibility is obtained by using objectives of medium 
or low N.A. In practice, however, it is often necessary to choose the 
objective primarily for satisfactory lateral resolution of the different 
diffraction patterns and surface detail, medium to high N.A.’s being 
necessary. The best working rule is therefore to compromise by using 
the lowest possible N.A. (i.e. longest possible focal length) consistent 
with satisfactory resolution. Assuming the steps observed on basal faces 
to be only 4:6 A. high, then for steps of this, or lesser, height, it may 
often be necessary to sacrifice resolution for the sake of satisfactory 
visibility. 


§ 5. COMPARATIVE SENSITIVITIES AND ADVANTAGES OF THE PHASE 
ConTRAST AND Narrow Pencit METHODS. 


With the narrow illuminating pencil method a 7-9 A. step is visible 
when the surface is unsilvered, but the contrast is very poor indeed. 
When the surface is silvered the visibility of such a step increases very 
considerably. Fig. la (Pl. XXII.) shows a typical step under these 
conditions (the visibility of this step when the surface is unsilvered is too 
small for reproduction). When phase contrast equipment is used a 7-9 A. 
step on an unsilvered surface is found to have a greater visibility than when 
the surface is silvered and the same step observed under narrow pencil 
illumination (see fig. 16). When the surface is silvered and viewed 
under phase contrast conditions a further increase in visibility results 
(fig. 1c). 

If it could be assumed that the steps on prism faces of beryl were visible 
merely because of being simple steps, then it would appear that the main 
advantage of phase contrast methods would lie in the observation of 
steps of height less than 7-9 A. For reasons discussed in §7 and §8 such 
a deduction would not seem to be justified. 

While the limit of observable step height is under discussion it is 
worthy of mention that for a step of metrical height ¢ the optical height 
in reflection is 2ut where » is the refractive index of the medium. By 
using a medium of high refractive index (such as cedar wood oil) in contact 
with the silver film it is possible to increase the optical height of a step 
considerably. As it is normally desirable to use a small observing N.A. 
for steps of low visibility a dry objective must be used. The conditions 
for an increase in the optical path can be easily satisfied by allowing a 
thin film of cedar wood oil to spread over the surface and then backing 
the film of oil with a cover glass. This results in an increase of visibility 
of the step. As metallurgical objectives are normally corrected for use 
with uncovered specimens some loss of definition must result, but as the 
paramount requirement is the observation of small steps this slight loss 
of definition can be tolerated. 


Microscopic Studies on Beryl Orystals.—I. 781 


In addition to the greatly increased contrast in the image the two 
other main practical advantages consequent upon the use of phase 
contrast are : 


(i.) that in many cases it is convenient to work with an unsilvered 
surface. For example hydrated crystals are likely to decompose when 
placed in a vacuum for the purpose of silvering. Moreover with a new 
specimen it is usually necessary to scan the faces for areas of interest 
and a considerable saving in time results if the different faces of each 
crystal do not need to be silvered for such a preliminary survey ; 


(u.) that the narrower patterns enable closely spaced steps to be resolved 
much more easily. Close series of steps which are not resolved with the 
narrow pencil method can be resolved comfortably in phase contrast 
conditions. 


A truly specularly reflecting surface is necessary if phase contrast 
equipment is to function most efficiently. Crystal faces which approach 
this condition are not unduly rare, but very often features of 
interest lie on faces or areas, which have been much roughened by the 
deposition of considerable amounts of foreign material. In such cases 
the increased visibility due to phase contrast may decrease markedly 
and in extreme cases it can become little, if any, better than when the 
same area is viewed using narrow pencil illumination. 


§ 6. CAUSE OF VISIBILITY OF THE STEPS. 


It is surprising that a step of 7-9 A. should produce a diffraction effect 
of relatively high visibility, and we shall therefore consider possible 
causes. 

First it is necessary to note that the measurement of step height only 
gives the difference in height between two adjacent planes. This does 
not mean that the effective step riser is of this height, for it is conceivable 
that the original step edge of 7-9 A. height has been modified, by the 
deposit of impurity or other causes, to a much higher ridge. However 
such a ridge, if present, would have to be of lateral extent less than the 
resolving limit of an objective of N.A. 1-32, for no trace of continuous 
structure at the step edge can be observed. This could mean that the 
“ridge ’’ could only be visible by scattering and one would expect the 
intensity of the scattered light to be low. The diffraction pattern has, 
however, a high visibility although superimposed on a bright field. 

This line of reasoning appeared to be supported by dark field observations. 
In this case only scattering can be operative, but the intensities of the 
maxima making the positions of the steps are very low. For example, 
a step which required an exposure time of very few seconds with one of 
the bright field methods required about 15 minutes in dark field conditions, 
even although in the latter case the actual intensity of illumination was 


much greater. 


782 ' LL, J. Griffin on the 


The possibility of diffraction at a simple step was therefore considered... 
It was first shown by Rubinowicz (1917), starting from the ordinary 
Kirchhoff diffraction integral, that there are two alternative ways of 
considering diffraction by a plane screen. One can either (a) consider 
the contribution to the amplitude from every point on the screen, or 
(b) consider only the edge of the screen as effective, so that the resultant 
amplitude consists of the sum of the contributions predicted by geometric 
optics, plus the effect of “ boundary waves’’. The latter are spherical 
waves which behave as if they emanate from a source at the edge of the 
screen. 

The very fact that the 7-9 A. steps are uni-molecular means that the 
planes between risers are true planes and that the riser is a true discon- 
tinuity. Even allowing for a good deal of smoothing out of the riser by 
the deposited silver it must still act as a true discontinuity to the incident 
light. The conditions are therefore ideal and any diffraction effect should 
produce the maximum possible intensities. This does not however 
necessarily account for the observed high visibilities. 


§ 7. VARIATIONS IN VISIBILITY OF STEPS OF THE SAME HEIGHT. 


Considerable variations in the visibility of 7-9 A. steps occur and it 
should be possible to explain these variations in terms of the mechanism 
which renders the steps visible. 


(a) Effect of visible deposits of foreign material. 

The deposits may consist of (a) a general point speckling of the surface 
(see, e.g. Pl. XXIT., fig. 1). The visibility of steps is normally high in such 
regions ; (b) larger scale markings which often take various distinctive 
forms. One such type of marking results in the underlying steps being 
only faintly visible or invisible, while the continuation of the step outside 
of this area has a high visibility. 

From the viewpoint presented above this could be due to poor contouring 
of the step, resulting in a decreased effective step height, or a non-plane 
deposit of the material, so that the diffraction conditions are altered, 
or possibly both effects might be operative together. Complete invisi- 
bility might be due to these causes carried to extremes. 

The other types of large scale marking, when densely distributed, are 
associated with a considerable decrease in visibility and the step structure 
of large areas, or even whole faces, may not be seen. These large scale 
markings are associated with less obvious “blotchy” deposits which 
cover most of the available area. An explanation can therefore be offered 
in terms of a marked decrease in the plane diffracting areas. 


(b) Sudden increases in visibility. 
A very marked effect of this kind is shown in the triangular area in 


Ph XXIL., fig.2. Neighbouring areas without well defined limits also show 
an increased visibility, and other triangular areas of greatly increased 


Microscopic Studies on Beryl Orystals.—I. 783 


visibility also exist elsewhere on this face. It appears that such behaviour 
could only be due to a greatly increased step height in this area. This 
view is supported by dark field observations in which the intensity 
scattered by the “ super-visible ” steps is much greater than that scattered 
by the steps in adjacent areas. 

If this increase of visibility is attributed to an increase of height of 
simple steps, we have the problem of explaining how steps of great 
height come to be continuous in line with steps of smaller height. It has 
not been found possible to explain satisfactorily how such steps could be 
produced. For example, it appears that none of the normal processes 
of crystal growth would produce such a structure (Frank 1949, Burton, 
Cabrera and Frank 1951), and it is difficult to consider this as a crystalline 
overgrowth. 

If, however, the increased visibility is considered to be due to the 
presence of a high “ridge ’’, the formation of such a “ridge” can be 
satisfactorily accounted for, but one must accept that the mechanism 
operating in this small area, must also be operative in the adjacent 
areas, the sole difference presumably being a difference in dimension of 
the “ridge”. It is to be noted that the same type of relationship between 
the intensities in dark and bright field holds for the triangular area as 
for other areas. 


(c) Slow variations in visibility. 

In many cases a slow decrease of the visibility of steps takes place over 
comparatively large areas, the visibility eventually becoming very low. 
These variations are observed whether the silver film is present or not. 
All the evidence is in favour of the steps being of uniform height, so that 
changes in the state of the crystal surface must be responsible for these 
changes. 

It is probably a significant observation that in the low visibility areas 
there are very few deposits of foreign material to be seen, sometimes none, 

whereas they are present in all areas of high visibility. 


§ 8. VISIBILITY OF STEPS ON OTHER TYPES OF CRYSTALS. 


' Investigation of the step structure on cleavage faces of mica by A. J. 
Forty (1950), using phase contrast illumination revealed steps which were 
probably 20 A. high, corresponding to the height of the unit cell. The 
visibility of these steps was very low despite the large step height, but it 
appeared that these observations might be reconciled with those on 
beryl, because of the different surface structure in the two cases. As is 
well known cleavage slips of mica crumple, because of lattice stresses, so 
that the area between steps is usually by no means plane, and there is often 
a total variation in height of several wavelengths. 

Later work on crystals of silicon carbide by A. R. Verma (1951) showed 
that steps of 15 A. (corresponding to the height of the c-axis of the unit 
cell of type 6H) had a consistently very low visibility when viewed under 


784 L. J. Griffin on the 


phase contrast illumination. In this case the area between steps is 
perfectly plane and it does not appear that any kind of anomolous 
behaviour can be taking place. 

This evidence, and the observation of ‘“ super-visibility ” areas on 
beryl, appear to be sufficient to discredit the hypothesis of simple edge 
diffraction for the areas of high visibility. 


§9. PRESENCE oF “ CLuMPs ” oF MaTERIAL AT STEP Ep@Es oN BERYL. 


That steps should only have high visibility in areas where deposited 
impurities can be seen on the surface supports the conclusion that there 
must be deposited material along the step edge. Moreover the edge 
pattern always appears spotty, and not truly continuous, when narrow 
pencil illumination is used. Dark field observations usually show an edge 
as defined by a line of discrete dots, sometimes with considerable gaps. 
It therefore appears that the irregular edge patterns are not caused by 
impurity overlying a “ ridge ”’ and therefore blocking out sections of the 
pattern, but rather that discrete “clumps” of material are present. 
A puzzling feature was that the spottiness of the edges was considerably 
reduced when phase contrast illumination was used and photographs 
were taken in the usual way. The resolving power in such conditions 
should be considerably higher than with the narrow pencil method. 
However, by appropriate choice of exposure time it was found that the 
spottiness of the edge pattern under phase contrast conditions could be 
very obvious, and that instead of a continuous minimum some edges 
could be observed as lines of discrete, barely resolved spots. 

The low intensities of the dark field images can be ascribed to the 
‘clumps ” being shallow, but of sufficient lateral extension to be resolved, 
or almost so. Moreover if the clumps are some 200-300 A. high, then this 
height difference alone could account for a good deal of the deepening 
of the minimum in phase contrast conditions. 

It is of interest to note that high visibility step patterns have been 
observed on several crystals of beryl which came from very different 
sources, so that a deposition of clumps of material along step edges is 
probably quite a common phenomenon. Their chemical nature might 
be investigated by chemical attack, but this is not permissible with the 
specimens at present available. 

The non-complementary intensity distribution on either side of the 
true focus is rather puzzling in the light of this explanation. It might 
have been caused by a deviation from axiality which was not detected : 
it may be due to combination of a weak simple step-diffraction pattern 
with the scattering from clumps : or it may be that the clumps differ in 
structure on the sides against and away from the steps. 

_ In terms of the “ clump ” explanation the areas of very low visibility 

may be made visible entirely by a simple step diffraction effect. The 
spottiness of the patterns in such areas is much reduced and no deposited 
impurity is normally to be seen on the surface. 


Microscopic Studies on Beryl Orystals—t. 785 


The precise form of the “ clumps ” at step edges should be revealed 
very easily by an electron PLBEOSEODS: and it is intended to make such 
observations. 


§10. Errrect or Strep HeIcut anp NATURE. 


On faces of some crystals it is found that the visibility of multi-molecular 
steps is less than that of small steps. This apparently paradoxical situation 
is due to the nature of the multi-molecular steps. The growth layers on 
a crystal face are commonly uni-molecular when first formed and it is 
necessary for the formation of multi-molecular layers that the unit layers 
draw together for some reason. This behaviour can take various forms 
and may apparently be due to various causes ; the phenomena is described 
by the generic term “ bunching ” 

We are concerned here with bunching only as it affects experimental 
observations. An example is shown in Pl. XXIL., fig. 3 in which the unit 
steps, although drawn together, remain quite widely spaced on a molecular 
scale. This is an area of major rhomhohedral face (1011) of a quartz 
crystal. The steps are approximately 200 A. high and yet their visibility 
is poor when examined with narrow pencil illumination as in this figure. 
Interferometric measurements showed that the risers of these steps make 
an angle with the planes of approximately 0-5°. This angle of inclination 
and step height was roughly maintained over almost the whole of the face, 
so that it is understandable that risers some 3 microns in lateral extent 
should produce a diffraction effect of low visibility. 

A case where bunching greatly increases the visibility is shown in 
Pl. XXIII, fig. 4. This is part of a terminal face (0001) of beryl and it 
appears that the multi-molecular steps make a steep angle with the plane of 
the face. Consequently the step edge still acts as a true discontinuity and 
the greater step height therefore results in increased visibility. (The 
general difference in intensity between neighbouring areas in fig. 4 is, of 
course, because the step height is great enough for phase differences to 
be shown as intensity differences.) 

It appears possible that all gradations of bunching may exist, from 
bunches making very small angles with the face to bunches perpendicular 
to the face. The main experimental significance to be attached to the 
observation of low angle bunching of the quartz type is that if any one 
good crystal shows this kind of behaviour consistently it may well be 
that the great majority of all crystals of this substance will behave in 
this way. As the most detailed information about growth processes 
can only be obtained by observation of uni-molecular steps it is therefore 
advisable to avoid such substances when possible. For example exami- 
nation of some 50 faces of various forms on quartz crystals by B.T.M. 
Willis (private communication) has shown that in every case when steps 
were visible a considerable degree of bunching had taken place, the 
bunching normally being of the low angle type. 


SER. 7, VOL. 42, NO. 330.—JULY IQ51 3H 


786 On the Microscopic Studies on Beryl Crystals.—I. 


Ionic crystals appear to be particularly susceptible to the onset of 
bunching, but from a limited amount of work on diamond crystals, for 
example, it appears that steps on other types of crystals may normally 
be multi-molecular. 


ACKNOWLEDGMENTS. 


I am grateful to Dr. F. C. Frank and B. T. M. Willis for much helpful 
discussion on the visibility of the steps, and to Professor 8. Tolansky 
in whose laboratory the early part of this work was carried out. JI am 
glad to express my thanks to the Department of Scientific and Industrial 
Research, for financial support during all phases of this work and to the 
Keeper and staff of the Mineralogical Department, British Museum for the 
loan of specimens. 

REFERENCES. 
BaRakKatT, N., 1949, Nature, Lond., 16, 603. 
Braga, W. L., and West, J., 1926, Proc. Roy. Soc. A, 111, 691. 
BuRTON, K., Casrera, N., and Frank, F. C., 1951, Phil. Trans. Roy. Soc., 
in press. 
Forty, a ., 1950, private communication. 
Frank, F. C., 1949, Discussions Farad. Soc., 5, 48. 
GRIFFIN, L. J., 1950, Phil. Mag., 41, 196 ; 1951, Ibid., in press. 
Rusinowticz, A., 1917, Ann. Phys. Lpz., 58, 257. 
VerMA, A. R., 1951, Phil. Mag., in press. 


L. J. GRIFFIN Phil. Mag. Ser. 7, Vol. 42, Pl. XXIl. 


-—_— 
S 
— 
i= 
tt 
NAN N 
x La 
x 
os 
oy 
= 
= 
=) 
| ed 
NX 
x 
a 
ap 
| 
= 
= 
= 
Ye) 
oS x 
| 
NA 
x 


To face page 786 


L. J. GRIFFIN 


Phil. Mag. Ser. 7, Vol. 42, Pl. XXIII. 


[ersten 


LXXX. A Design of an Ultrasonic Delay-Line. 


By T. Goxp, 


Cavendish Laboratory, Cambridge*. 
' [Received March 29, 1951.] 


SUMMARY. 


A simple design of a vertical folded mercury delay-line is described. 
The device is used as a memory unit for electronic apparatus, where it 
introduces a delay of 1:2 millisecond with a band-width of several 
megacycles per second. ; 

Design considerations and essential details of construction are given, 
and the operating conditions are mentioned. 


§ 1. IyTRopuctTIon. 


THE device described here is an ultrasonic delay-line, designed for delay 
or storage of electric pulses. Its present application is in the Pulse 
Analyser described in another paper (Hutchinson and Scarrott 1951). 

The present design is in many respects similar to that which was 
suggested by the author and adopted for the EDSAC (the high-speed 
digital computing machine at the Mathematical Laboratory, Cambridge). 
As usual it employs mercury to propagate the soundwaves, and X-cut 
quartz plates as electro-mechanical transducers. The delay is approxi- 
mately 1-2 milliseconds, and the necessary path length of five feet is 
accommodated in an object of roughly half that length. The necessary 
reflection is made by means of a perspex corner reflector, possessing three 
mutually perpendicular surfaces. 

This design of a “‘ folded line ” reduces the overall length, allows one to 
mount the device vertically without prejudicing the reliability, and 
obviates the necessity of precise alignment of the structures at the two ends. 


§ 2. DESIGN CONSIDERATIONS. 


(1) Transmission medium. 

The acoustic properties required of the transmission liquid (solids are 
less suitable) are a low velocity of sound, a low attenuation per unit time, 
and an acoustic impedance similar to the electro-mechanical transducers 
used. X-cut quartz is the most suitable material for transducers, and its 
acoustic impedance (density x velocity of sound) is 1-52 x 10° in c.g.s. units. 
Clearly a very dense liquid will be required ; and mercury with an acoustic 
impedance of 1-93 x 108 is satisfactory in all respects. 


*Communicated by Professor O. R. Frisch, F.R.S. 
3H2 


788 T. Gold on a 


(2) Quartz mounting. 

A mounting in which only one side of the quartz plate is acoustically 
loaded appears to be the most satisfactory. A slight increase in the 
transducer bandwidth could be obtained by a double sided loading ; 
but as this bandwidth in the present case is approximately 8 Mc./sec. 
(about a frequency of 13 Mc./sec.), no such increase is required for most 
purposes. 

The acoustic mismatch from quartz to air is so great, that an airgap of 
as little as 10-5 em. is practically a complete barrier at frequencies of the 
order of 10 Me./sec. Consequently a thin quartz plate can be robustly 
supported by being pressed firmly onto a smooth, flat, dry backing plate ; 
the very thin air gap which remains when normal engineering methods are 
employed, will still prevent any appreciable acoustic transmission to the 
back. 

In the present design the thin quartz plates (+ mm. thick) ( see fig. 1 (4)) 
are pressed onto the flat backing plate (J, m, 2) by rubber rings which act 
as mercury seals (j). Under these conditions the quartz plates will 
withstand several atmospheres of hydrostatic pressure, and the line can 
hence be mounted vertically with the quartz at the bottom. 


(3) The acoustic channel. 

The tube should be wide enough so that (a) the sound waves reflected 
from the walls form a negligible contribution, (b) no waves travelling in the 
wall material are set up. The dimensions chosen here, of an internal tube 
diameter of one inch and a diameter of the active area of the quartz of 
2 of an inch (brass electrode n) have been found empirically to avoid 
such disturbances: but it is known that a considerable departure from 
these dimensions can be tolerated. In particular, smaller dimensions 


may be desirable to economize mercury. 
(4) The corner reflector. 

The reflector consists of three mutually perpendicular reflecting surfaces, 
arranged like the corner of a cube, viewed from the inside. The surfaces 
must possess a high sonic reflection coefficient in mercury ; perspex is 
suitable for this, and is also easy to shape. 

Such a corner reflector has the convenient property that incident and 
reflected rays are parallel to each other and symmetrically placed on either 
side of a line parallel to the rays and passing through the point of the 
corner. The reflection is hence unaffected by a rotation about that point, 
so long as the ray is reflected by each of the three surfaces. 

It is required that the wavefront arriving at the receiving quartz shall 
be parallel to it to a precision of a fraction of a wavelength over the active 
area. For the highest frequency components which may be utilized, say 
17 Mc./see., and using the criterion of a quarter wavelength permissible 
error, this would imply an angular precision of 1-3 x 10-8 radian. It is not 
easy to construct a line so that the two ends are parallel to such a precision, 
and structures of great rididity are then required. The consequence of a 


Design of an Ultrasonic Delay-Line 789 


greater angular error would initially be merely to decrease the high 
frequency response; but errors of more than 510-3 radian would 
seriously impede the function over the entire frequency band. 

In the present design these requirements are much easier to satisfy, 
for it is only necessary that the two quartz mountings, which are side by 
side, shall be parallel to this precision, and that the corner reflector shall be 
accurate. But owing to the fortunate property of a corner reflector, no 
great angular precision need be carried from one end of the object to the 
other. 


Fig. 1. 


Elevation of delay line. (a) Filling plug with “ breathing ” hole: Perspex ; 
(b) Corner reflector : made of Perspex in several pieces, glued with perspex 
solvent ; (c) Steel plate ; (d) Rubber rings, making the seal between the 
tubes and the reflector assembly ; (e) Steel rings, attached to (¢) with four 
screws each; (f) Steel tubes, 1 inch internal diameter; (g) Rubber rings, 
as (d); (h) Steel rings, attached by four screws each to plate (/), through 
clearance holes in plate (7); (i) Steel plate, attached with screws to (J) ; 
(j) Rubber rings, making the seal between (h) and the quartz plates (k) ; 
(k) Quartz plates, x-cut, 13 Mc./sec. resonant frequency, one inch diameter ; 
(1) Steel backing plate ; (m) Tufnol insets ; (n) Brass electrodes: (l, m,n, 
o, p) are assembled before surface grinding; (0) Tufnol washer ; (p) 
soldering tag. The girder linking plates (c) and (/) is not shown. 


790 T. Gold on a 


The less stringent precision requirements of the positioning of the point 
of the corner (which should be half way between the normals to the centres 
of the quartz plates) and of the tubes can be met easily, and do not warrant 
discussion. 


§ 3. DreTaILs oF CONSTRUCTION AND FILLING. 


It has been found convenient not to rely upon the tubes themselves for 
rigidity, but to mount them in a U-section steel girder. The mercury seals 
between the tubes and the end assemblies may then be made by rubber 
rings (d and g) which are compressed by about 20 per cent as the endplates 
are tightened onto the ends of the girder. (This girder is not shown in 
fig. 1; it connects plates / and c.) 


Fig. 2. 


Plan view of the corner reflector and top assembly. 


The insets in the backing plate (m and n) are glued and screwed into 
position before the entire surface is ground flat.. The plate (i) and the 
rings (/) will allow the quartz plates to be fitted independently of the rest . 
of the assembly. The rubber rings which bear on the quartz surface (j) 
are compressed by 15 to 20 per cent when the clamping rings are screwed 
down tight onto the plate (7). 

The only known causes of subsequent failure of a mercury delay-line are 
leakage, fracture of quartz plates, or the adhesion of gas to the quartz 
surface. Leakage is avoided by using high grade rubber or neoprene 
gaskets cut out of sheet to assure flatness, and by care in adjusting dimen- 
sions to give the correct compression. Fracture of the quartz after 
assembly may result from insufficiently flat or subsequently distorted 
backing plates; one has to use a sufficiently thick plate and carefully 


grind the surface after the insets have been fully tightened and allowed to 
“settle down ”’. 


Design of an Ultrasonic Delay-Line 791 


To avoid the adhesion of very small quantities of gas to the quartz 
surface, which may be sufficient to impede the acoustic transmission, it is 
advisable to have all internal surfaces very clean before filling, and to use 
mercury free from. impurities which cause scum. For the same reason it is 
essential to fill the line first with a liquid which wets all surfaces well and 
thus excludes air, and to replace this liquid directly with the mercury. 
Alcohol is suitable for this. The present design, with the quartz at the 
bottom, avoids the risk that small quantities of gas migrate there later ; 
a risk which exists for horizontal lines, and which would presumably be 
severe in any design where a quartz plate faces downwards. 


Fig. 3. 


Plan view of quartz mountings. 


§ 4. OPERATING CONDITIONS. 

The mercury itself acts as the earthed electrode. The input and output 
are hence applied to the brass electrodes, whose frontal area determines the 
active area of the quartz, and also the input and output capacity. In the 
present case, with 13 Mc./sec. x-cut quartz plates this capacity is approxi- 
mately 30 pf. Owing to the small value of the piezo-electric coefficient 
and the good acoustic match, other terms in the input impedance can 
usually be ignored for the purpose of circuit design, and the problem is then 
simply to apply the desired waveform across this capacity. At the 
output an increase of voltage can be obtained, at the expense of bandwidth, 
by tuning with an inductance ; or alternatively a connection may be made 
simply to the grid and gridleak of a valve. 

The output voltage, when feeding into the small capacity of the grid ofa 
valve, is approximately one thousandth of the voltage applied at the input. 
As the system cannot in any case pass low frequencies, the output is 
usually taken to a tuned amplifier. 


REFERENCES. 


Hurtcuinson, G. W., and Scarrortt, G. G., 1951, Phal. Mag., 42, 792. 
Wixzs, M. V., and Renwick, W., 1948, Hl. Engin., 20, 208. 


e792 


LXXXI. A High Precision Pulse Height Analyser of Moderately High Speed. 


By G. W. Hurcuinson, M.A., and G. G. SCARROTT, 


Cavendish Laboratory. Cambridge*. 


[Received February 26, 1951.] 


SUMMARY. 


A pulse height analyser is described suited particularly to the 
requirements of nuclear physics. Information is stored in a supersonic 
delay line and displayed on a cathode ray tube. The accuracy of channel 
width and the linearity of response are both one part in a thousand. 
The number of channels and storage capacity per channel may be varied 
at will to give 60 channels of up to 10® pulses, 80 channels of up to 
3104 pulses, or 120 channels of up to 1023 pulses. The maximum 
counting rate is 1600 per second. 


A PULSE analyser serves to sort electrical pulses according to their peak 
voltage. The present analyser, in common with most of the other types 
which have been described, does this by displaying the number of pulses 
which have been received during an experiment within each of a series 
of size ranges; the boundaries between the ranges, or channels, are 
equally spaced in voltage. 

The optimum number of channels is often determined by the experi- 
mental scatter of pulse size or by the need to present the information 
in an easily digestible form, that is, as a number of separate data which ~ 
is not too large to be readily analysed. Apart from such limitations an 
analyser should lose as little information as possible in the presentation 
of a result. This would be achieved by a kicksorter with a large number 
of channels—about equal to the total number of pulses to be analysed— 
and a small capacity per channel. In the extreme case with an infinite 
number of channels and a capacity of one in each channel, none of the 
information would be lost. 

These ideal requirements are modified when we consider the effects 
of channel boundary drifts. One must distinguish two kinds of drift— 
common drift which causes an uncertain zero error, and relative drift 
which causes uncertainty in the channel width. The common drift 
then places a lower limit to the useful width of each channel, i. e. if the 
common drift is, say, 0-2 V., then no advantage can be gained by making 
channels much narrower than 0-2 V. 

The effect of relative drift is much more serious. Suppose the channel 
width is v and the counting capacity per channel is n, then unless the 
probable error in channel width is less than v/\/n the full statistical 


* Communicated by Professor O. R. Frisch, F.R.S. 


On a HighPressure Pulse Height Analyser of M oderately High Speed 793 


significance of the channel counting capacity cannot be realized. This 
means that if the channel width be made 0-2 V. and n has the modest 
value of 104, then the channel boundaries must be fixed relative to one 
another with a precision of 2 mV. 


: $1. 

Several authors (see References) have described electrical pulse height 
analysers for use in nuclear physics research. Most of these either 
are slow because they employ a mechanical system to record each pulse, 
or they use very large numbers of components to provide a scaling circuit 
before each of many mechanical recorders. In addition, many, but not 
all of them divide the spectrum of pulses into discrete channels whose 
boundaries are determined separately by voltages applied to electrodes 
of different valves. The relative channel boundary drift is therefore as 
large as the common boundary drift, so that comparatively few large 
channels can be usefully employed. , 

The pulse spectrograph (Maeder et al. 1947) suffers from neither of 
these disadvantages, but since it presents its information only as a 
diagram it is not very suitable for work in which precise knowledge of 
the number of pulses in a given spectral interval is required. It has also 
been found liable to serious error due to irregularities in the fluorescent 
screen of cathode ray tubes. 

The analyser designed by D. H. Wilkinson (1949), while limited in 
speed by mechanical recorders, avoided relative boundary drift by 
gating an oscillator for a time proportional to the height of the pulse and 
then determining the appropriate channel by counting the oscillations 
during this time. This meant that, in effect, beginning from the random 
instant of arrival of the pulse the various channels were presented in turn 
using a common mechanism for determining all the channel boundaries. 

In the present analyser the channels are presented in turn in a regular 
cycle, irrespective of the random time of arrival of pulses, the comparison 
mechanism being again common to all the channel boundaries. Again 
the definition of the relative channel widths depends on a time measure- 
ment which can be very accurate, and on the linearity of a saw-tooth 
waveform. This comparison with a regularly recurring waveform allows 
the use of a supersonic delay line memory system to store the information, 
and hence the combination of high precision with a much higher speed 
and considerable economy in components. 


§2. FuNcTIONAL ARRANGEMENT. 


The desirable functions of a pulse analyser may be divided up as 


shown in fig. 1. 

The memory and display are often the same, but functionally they are 
distinct. The gate is needed unless the rate of presentation of pulses 
to the machine is very slow compared with the time taken to record a 


\ 


794 G. W. Hutchinson and G. G. Scarrott on a High 


pulse. It must be shut as soon as a pulse has been accepted for recording 
and reopened only after the sorting system has been cleared, otherwise 
more than one pulse may be presented at the same time to the sorter 
and the spectrum will be distorted in a way determined by the nature of 
the sorter (e.g. the smaller pulses may be masked by the larger). In 
order to reduce the loss while the gate is shut, the time taken to record 
a pulse should be as small as possible. 

It is not in general required to study a complete spectrum of pulse size. 
The lower end of the required spectrum may be limited by noise or may 
be arbitrarily set at a higher value. The ratio of the lower and upper 
end is also arbitrary. The function of the selector is to allow only those 
pulses to operate the sorting and gate mechanism which, after passing 
through the biased amplifier, lie within a specified voltage range. The 
limits of this range, when referred to the input of the analyser, may be 
made to agree with those of the required spectrum by the bias and gain 
controls of the biased amplifier. Ifa linear scale is required, the selector 
lower limit must be set at such a level that pulses within the range of 
the finite non-linearity of the biased amplifier are rejected. The presence 
of the defined upper limit means that the analyser does not waste time 
by trying to sort pulses which are larger than the required range. 


Bien: 


BIASED 
PREAMPLIFIER SELECTOR SORTER MEMORY DISPLAY 


To increase the accuracy of the machine the voltage level of the pulses 
during the passage through the biased amplifier and sorter should be as 
high as possible (up to 200 volts in the present case). The pulses enter 
the machine usually from a standard pulse amplifier producing pulses 
only up to 50 volts, so it is also desirable to insert a ‘‘ preamplifier ”’ 
before the biased amplifier. 

Finally, it is useful to have a source of standard and variable pulses 
available for calibration either by insertion at a 50 volt level directly 
into the analyser, or at a level of millivolts into a standard condenser 
connected to the input of the external amplifying system. 

All these functions are incorporated in the present analyser, and will 
be described separately. 


§3. AccURACY AND STABILITY. 


The standard voltages with which the voltage levels of the pulses are 
compared are derived ultimately from a stabilized power supply of 
conventional series-parallel design. This in turn derives its standard 
of voltage from a neon stabilizer, type 85A1, whose stability is stated 
by its makers to be one part in 1000. The circuits of the analyser have 
been designed to have this same accuracy both in channel width and in 
linearity of amplifiers and sorting circuits. 


Pressure Pulse Height Analyser of Moderately High Speed 795 


The variations in the effective cathode potentials of valves are known 
(Gray 1948) to be of the order of fractions of a volt. If drifts were not 
to exceed 0-1 per cent of 200 V., the voltage level throughout the analyser, 
it was necessary to arrange that variations within the valves should 
cancel to the first order. Whenever possible the two halves of one double 
triode have been used in this cancellation. 

As an independent check, the calibration of the test pulses is made to 
depend in the first instance only on a wire-wound resistance and a 
sub-standard milli-ammeter, and their stability.on that of a dry battery. 


§4. MEmory. 

A mercury supersonic delay line (Harvard Annals) is used to 
store information. To save space, a folded vertical line designed by 
Mr. T. Gold was used. Impulses are fed to one end of the line by a quartz 
crystal and are detected after a time T,, which is about 1-2 milliseconds, 
by a similar quartz crystal at the other end. The electrical impulses 
from the detector crystal are amplified, réshaped, altered if required, 


TRANSMITTER 
COINCIDENCE 


DIGIT TIME 
PULSE 
NERATOR 


Fig. 2. 


= DELAY LINE Sr 


RECEIVER 


ADD, SIGNAL 


ADDING GATE 


DIGIT FREQUENCY 
PULSED OSCILLATOR 


CHANNEL FREQUENCY 
PULSED OSCILLATOR 
TO LINEAR AMPLITUDE 
TIME SWEEP DISCRIMINATOR 


and fed again to the transmitting end of the line. Once a particular 
time sequence of impulses has been set up in the system it will therefore 
circulate indefinitely unless it is deliberately altered. by an adding gate. 

The cycle is synchronized with the comparison system in the following 
way. While the trigger pair of fig. 2 is in one state A the two pulsed 
oscillators and the linear time base are free. While it is in the other 
state B, they are held fixed. The change from B to A injects a pulse 
into the delay line through the adding gate. The discriminator is arranged 
at such a level and the linear time base at such a speed that the trigger 
pair is allowed to remain in the state A only for a time T,, which is a little 
less than T,. The trigger pair is A.C. coupled with such time constants 
that, if running free, it would remain in the state B for a time T,, where 
where T,>T,—Ts. 


INJECTION 
TRIGGER | B>A 


TRIGGER 
PAIR 


CHANNEL TIME 
PULSE 
GENERATOR 


‘COMPARISON 


796 G. W. Hutchinson and G. G. Scarrott on a High 


When the instrument is switched on there are no pulses in the delay 
line. The first transit from B to A produces a pulse (waveform a of 
fig. 3) which reappears from the amplifier after a time T,. At this time the 
trigger pair is in the state B, and it is therefore forcibly triggered into 
the state A. The cycle is then repeated with the period T, so long as the 
condition T,>T,—T,>9 is fulfilled. Since T; can be large, this means 
that small drifts in T, or in T,, which varies slightly with temperature, 
may be tolerated without destroying the synchronism. 

The waveforms from the linear time base, the trigger pair and the two 
pulse generators are as shown at 5, c, d and e in fig. 3. 


Fig. 3. 
Se eee 


¢— So = ne Seer WW) AY 
eee Se ee 
T, x 
Tz 


Any signal occurring during the interval T, after the initiating signal 
arrives when the trigger pair is already in state A and therefore has NG 
effect on the cycle. This period is divided into about 1200 equally spaced 
possible signal positions by the waveform d (short pulses spaced 1 x sec. 
apart). Signals occurring during the period (T,—T.) before an initiating 
signal, would upset the synchronism, but there is no mechanism by which 
they are ever produced. 

Waveform ¢ divides the signal positions into equal groups of n. With 
each group is associated a range of voltage, i.e. that traversed by the 
waveform 6 during the passage of the preceding group through the 
adding gate. The groups therefore define the channels of the pulse 
analyser and the pattern of signals present in each group represents the 


Pressure Pulse Height Analyser of Moderately H igh Speed 797 


number of pulses received whose heights lay within the associated 
voltage range. By using the presence and absence of signals to represent 
the digits 1 and 0 of the number written in the binary system, any number 
up to (2n—1) can be recorded. 


§5. Putse LENGTHENER. 


A pulse to be analysed will in general arrive at random with respect 
to the cycle of the memory. It can be recorded only at a time when 
the group of digits appropriate to the height is passing through the 
adding gate. This will occur at any time up to T, after its arrival. 
Each pulse which is accepted by the selector is therefore made to charge 
a condenser through a diode to a potential equal to its height. The 
condenser retains its charge until the pulse has been recorded and is then 
discharged ; a well-known principle of pulse lengthening. But the 
requirements on the present circuit are unusually stringent. The 
potential must be held for a random time up to 1-2 milliseconds with a 
precision of 0-2 V. The capacity, c, of the storage condenser is limited 
to about 300 yuk. by the minimum length of the incoming pulses and the 
total source impedance of the charging circuit. Hence the leakage current 
must not exceed 0:06 wA. This is easily realized in the condenser itself, 
and as it is charged by negative pulses and controls only a cathode 
follower, the anode and grid thus connected to the circuit do not give rise 
to an appreciable leakage current. In order to discharge the condenser, 
however, electrons must be removed from it, and if this were done by 
connecting a heated cathode to the system, the heater-cathode leakage 
resistance would be much too low. The method used is to heat the 
discharging cathode from an insulated source of A.C. power whose 
D.C. potential could follow that of the charged system. Two other 
alternative methods have been used with success. One is to use the 
secondary cathode of an electron multiplier valve, and the other is to 
discharge the system through a neon tube which is held in a non- 
conducting condition during the duration of the lengthened pulse. Of 
these, the first is less stable because a secondary cathode is capable of 
being triggered into a stable condition in which its potential relative to 
the primary cathode is too low to give a secondary emission ratio greater 
than unity ; it then ceases to act as a discharging cathode until it is reset 
externaily into the operating range. This can be avoided or overcome, 
but with the secondary emission valves at present available, the stable 
limits appear to be inconveniently near to the operating range in this 
circuit. 


§6. SORTER. 


The general functional arrangement is shown in fig. 4. 

The potential of the cathode follower driven, by the pulse lengthener 
is compared with that of the output of the linear time base of fig. 2 by 
applying them to the grids of a double triode as shown in fig. 5. 


798 G. W. Hutchinson and G. G. Scarrott on a High 


It is arranged that when the potentials of the two anodes become equal 
and B is becoming more positive, a trigger pair is caused to change its 
state and transmit an extended negative signal to the negative going 
coincidence circuit. The coincidence of this signal with the next signal 
from the channel pulse generator (waveform e on fig. 3) indicates the 
start of the group of digits forming the appropriate channel for recording 
the pulse. 


Fig. 4. 


SIGNAL SHUTTING 
GATE OF FIG. 1 


PULSE 
LENGTHENER 


TRIGGER 
PAIR 


COMPARISON COINCIDENCE 


CHANNEL ADD SIGNAL TO 
SWEEP OF FIG, 3 PULSE OF FIG.3. ADDING GATE 


Although this sorting process is carried out by an _ elaborate 
combination of valves, many of which can contribute to channel boundary 
drift, it is clear that the same process is used to define all the channel 
eUpeaty so that common drift and not relative drift is the resultant 
effect. 


Fig. 5. 


Waveform a Waveform b 


CONSTANT 
CURRENT 


If the rise time of the pulses being analysed is long, a pulse might 
be recorded before it had reached its full height through an intersection 
of waveforms in the comparison circuit of the form shown dotted 
in fig. 5. To prevent this, a third coincidence signal is introduced, which 
allows a pulse to be recorded only while the gate of fig. 1 is shut. 

Of the ” position in any one group passing through the adding gate 
the first, if occupied, has the significance of 2° pulses, and the last Qn 
The process of adding “1” to a number already present in this fern 


Pressure Pulse Height Analyser of Moderately High Speed 799 


may be stated in two equivalent ways. (a) Change the first digit. If 
the result is 0, carry 1, 7. e. change the next digit, and so on. (b) Change 
all digits as they are presented until a 1 has been written, and then leave 
the remainder unchanged. The first is the process normally used in 
electronic calculating machines, and is at present employed. The second 
involves an equally simple circuit, and has also been used. There is 
little to choose in the relative merits of the two systems. 


§7. PREAMPLIFIER, BIASED AMPLIFIER, GATE AND SELECTOR. 
The two amplifiers are negative feed-back rings of three valves having 
a high loop gain so as to ensure linearity and stability. Their gain is 
determined by the ratio of high stability carbon resistances. The bias 


Fig. 6. 
SHUT SIGNAL 
GATE FROM GATE 


UPPER 
DISCRIMINATOR 


DISCRIMINATOR 


COINCIDENCE 
8 A 


COMB INE 
(EITHER / OR) 


TRIGGER 


RELEASE PULSE 
PAIR 


LENGTHENER 


ADD SIGNAL 
OF FIG. 4 


RESET PULSE 
LENGTHENER 


DIFFERENTIATE 
TAIL 


is external to the amplifiers, and is determined by two similar pentodes 
forming a long-tailed pair. The pulses are negative when passing through 
the gate, which has the form of a double cathode follower. The 
functional arrangement of the selector is shown in fig. 6. 

A pulse falling within the height range between the lower and upper 
limits will change the lower discriminator from its quiescent state A to 
its state B. This in turn changes the state of the trigger pair from 
state A to B, and thus removes the standing discharging current from 
the pulse lengthener. When the pulse dies away to a level below the 
lower limit, the discriminator returns to state A and a ‘coincidence is 
formed which closes the gate. The gate is reopened and the pulse 
lengthener is reset by a signal from the sorter that the pulse has been 


recorded. 


800 G. W. Hutchinson and G. G. Scarrott on a High 


A pulse which passes above both limits resets the trigger pair before 
the lower limit has returned to state A. The gate is therefore not shut 
and no pulse is recorded. The act of resetting the trigger pair or resetting 
the lower limit, whichever occurs the later, resets the pulse lengthener. 


§8. DISPLAY. 

The functional diagram is shown in fig. 7. 

A raster of faint vertical lines is produced on a cathode ray tube, each 
vertical sweep occurring during the passage of the group of positions 
corresponding to one channel through the adding gate. Pulses from 
the coincidence unit of fig. 2 are used to brighten the trace, so that each 
vertical line contains a series of bright dots and faint spaces respectively 
for the digits 1 and 0. The digits 2° are arranged to occur at the bottom 


Big aie 


SYMMETRICAL 
TRIGGERED 
TIME BASE 


¢ CHANNEL 
PULSE 
OF FIG.2 


ANODE 
FOLLOWER 


LINEAR 
SWEEP 
OF FIG.2 


of each line. It is a convenience to allow the 1 sec. pulses to make a 
slight modulation of the brightness even in the absence of a ‘‘ 1’, since 
the power of 2 corresponding to any one digit can then more easily be 
read at a glance. 

It would be possible to arrange also for a linear display of the spectrum 
of pulses, but this has not so far been included. The present display 
gives a very clear picture of what is going on during the course of an 
experiment. The height of the uppermost dot in each column gives a 
rough visual indication of the number of pulses on a logarithmic scale, 
and with a little practice it soon becomes possible to detect more detailed 
aspects of the spectrum by regular patterns in the lower dots of the 
columns. The information can either be decoded directly from the 
screen or may be traced or photographed. With the aid of a slide rule 
designed for the purpose by Professor Frisch, the time taken to decode 
a complete spectrum is about ten minutes. 


§9. CALIBRATION PuLSE GENERATOR. 


The principle of the generator is that a standard current is switched 
from one anode to the other of a long-tailed pair of triodes. The size 
of the voltage pulses produced is determined by a variable resistance 


Pressure Pulse Height Analyser of Moderately High Speed 801 


network in the anode circuit of one of the valves. Provision is made for 
producing either short pulses for calibration of the analyser directly, or 
a square waveform of unit mark-space ratio intended for attenuation 
and injection into the input of an external amplifying system. Pulse 
repetition rates of 50 and 500 per sec. are provided and individual pulses 
can also be produced by a press button. 

The maximum pulse size is 50 volts adjusted by switches in steps of 
10 volts and 1 volt, and by a potentiometer, continuously up to 1 volt. 
There is an independent attenuator giving multiplying factors of 
1, 10-1, 10-2, 10-8 and 10-4, 


§10. SPEED. 


The average time taken to record a pulse is 4T,. Hence the maximum 
possible counting rate in the present machine is about 1600 per sec. 
If pulses are presented with a frequency comparable to this, the analyser 
selects a random sample for recording, but since the sampling is not. 
dependent on pulse height, the spectrum of pulses is not distorted. 

The minimum acceptable pulse length is limited by two factors :— 


(a) the charging impedance of the pulse lengthener ; 
(6) the short-period overshoot occurring in the negative feed-back 
amplifiers. 


If the pulse length is reduced to 31 psec., the error from these causes 
is already of the order of 0-1 per cent. But if the input pulses are made 
longer than 1 sec. and the counting rate is made more than 1000/sec., 
an error of more than 0-1 per cent will arise from the absence of D.C. 
restoration, which would be difficult to introduce before the biased 
amplifier in the presence of noise. The combination of these effects, 
therefore, sets an upper limit to the speed which is independent of the 
time taken to record a pulse. 


§11. StoraAGE Capaciry AND NUMBER OF CHANNELS. 


The number, n, of positions of binary digits in each channel can be 
varied at will by changing the frequency of the channel Frequency 
Pulses Oscillator of fig. 2. Three settings are provided: 120 channels 
each capable of recording 1023 pulses ; 80 channels of up to more than 
30,000 pulses or 60 channels of up to more than 10° pulses each. 

If a channel becomes full, the analyser clears it, and incidentally adds 
one pulse into the next higher channel. By noting the number of times 
that a given channel is cleared during a run, it is therefore possible to use 
the analyser, if necessary, for much longer runs than would be indicated 
by its storage capacity. 


$12. ALTERNATIVE Memory SYSTEMS. 


Other types of short-term memory (see References 20-23) such as 
a cathode-ray tube memory, a magnetic disc or tape, or an electrostatic 
disc could have been used instead of a supersonic delay line. The delay 


(SER. 7, VOL. 42, NO. 330.—JULY 1951 31 


802 G. W. Hutchinson and G. G. Scarrott on a High 


line was used because of its great convenience, simplicity and reliability, 
together with the fact that its most convenient range of delay times gives 
a suitable compromise between storage capacity and mean recording 


time. 
ACKNOWLEDGMENT. 


We wish to express our thanks to Mr. T. Gold for designing for us the 
very simple and efficient supersonic delay line used in the analyser. 

One of us (G.W.H.) wishes to thank the administrators of the 
University of Cambridge Clerk Maxwell Scholarship Fund for financial 
support during the course of the work. 


REFERENCES. 


Annals of the Computation Laboratory of Harvard University, Vol. 16, 
pp. 103, 125, 133. . 

BaLpDINGER, E., and Casate, R., 1949, Helv. Phys. Acta, 21, 172. 

Boots, 1949, Hlectronic Engineering. 

CooKE-YaRBoROUGH, E. H., Brapwetu, J., Fuorma, C. D., and 
Howe ts, C. A., 1950, Proc. Inst. Electr. Hng., Pt. II, 46, 108. 

Eppstern, J. 8., 1951, J. Sci. Instrum., 28, 41. 

Frevunpiicu, H. F., 1948, Brit. Atomic Energy Project. 

Freunpuicn, H. F., Hrnoxs, E. P., and Ozerorr, W. J., 1947, Rev. Sct. 
Instrum., 18, 90. 

Friscu, O. R., 1943, Brit. Atomic Energy Project. 

Friscu, O. R., in course of publication ; see also 1948, Discovery, 9, 236. 

Friscu, O. R., 1951, Phil. Mag., 42, in course of publication. 

GLENN, W. E., 1949, Nucleonics, 4, 6, 50. 

Goxp, T., 1951, Phil. Mag., 42, 787. 

Gray, W. J., 1948. Appendix C to ‘“ Vacuum Tube Amplifiers’ by 
G. E. Valley and H. Wallman. 

Kanan, T., and Kwartirorr, A., 1946, J. Phys. Radium, 7, 300; Ibid., 7, 357. 

Maeper, D., 1947, Helv. Phys. Acta, 20, 139. 

Maepber, D., Huser, P., and Stesier, 1947, Helv. Phys. Acta, 20, 230. 

PootE, M. J., 1944, Brit. Atomic Energy Report (May). 

Roserts, A., 1940, Rev: Scr. Instrum., 11, 44. 

Saye, E. A., 1944, Brit. Atomic Energy Report (Jan.) 

SELENYI, P., 1935, Zezts. f. Tech. Physik, 16, 607; 1936, Ibid., 17, 477. 

Turton, J., 1948, CLR. Acad. Sci., Paris, 226, 706. 

Watkins, D. A., 1949, Rev. Sci. Instrum., 20, 495. 

Westcott, ©. H., and Hanna, G. C., 1949, Rev. Sci. Instrum., 20, 181. 

Witxinson, D. H., 1949, Proc. Camb. Phil. Soc., 46, 508. 

Witurams, F. C., and Kinpurn, T., 1949, Proc. Inst. Electr. Eng., Pt. TI 
96, 81. 


A.P PEN DoLX: 


The following diagrams show the circuit details of the first model. 
This was designed to work with existing power supplies giving +300 V. 
and —200 V. stabilized lines. The power supplies for subsequent models 
have been designed to give three stabilized liens, and this has made 
possible some simplifications and improvements in the design. 


Pressure Pulse Height Analyser of Moderately High Speed 803 


Resistances are given in kilo-ohms except where otherwise stated. 

The capacities of all condensers lie between 5 uF. and 1 pF. and are 
stated in either »uFarads or uwFarads. The units are often omitted since 
no confusion can arise. To avoid confusing the diagrams, many 
decoupling condensers have been omitted from circuit points which carry 
only D.C. reference potentials. 


Fig. 1. 


EXTERNAL 
LINK FOR 
Ge EXTERNAL 
METER 


TEST PULSE ANALYSER 
OUTPUT INPUT 


‘eT PREAMPLIFIER 


—200v. 


-400v. 
CALIBRATION PULSE GENERATOR 


FROM INPUT ° 
R 


o 
CALIBRATION 


PRE-AMPLIFIER AMPLIFIER 


804 G. W. Hutchinson and G. G. Scarrott on a High 


The following valves are used escept where otherwise stated :— 


Pinigle diOdGs™ oF arse states oc. ty a reo eon cee EA50 
Double diodes. Fee Se, oe ee a eee VR54 
Double triodes °......... Pe OG SN eee ECC33 
Pentodes yr) fe Po ae, Oe eee EF50 and EF55 
Secondary Emission Valve ...........2s+ss+:4. E2133 
Neon (pulse lengthener)).°.5°.0. =» -2% «ate meets CV91 
Selenium Rectifier) .22 5 02. <2. eons ae eee per sir es A 
Fig. 3. 
ro cere eaten Cone rid 


ee 
a ua 


we eat 


TO COMPARISON 


GATE LOWER LIMIT UPPER LIMIT “ TRIGGER PAIR PULSE LENGTHENER ae 


ADD SIGNAL TO 
ADDING GATE 


—ve CHANNEL 
PULSE 


SWITCH —200v. 


COMPARISON TRIGGER PAIR COINCIDENCE —400 v 


Pressure Pulse Height Analyser of Moderately High Speed 805 


The terminal links in the resistance networks of the bias and 
calibration pulse circuits are provided for the insertion of an external 
milli-ammeter. 


The switch defining potential C is ganged with the gain control of the 
post-bias amplifier. 


FROM SUPERSONIC 


DELAY LINE SF < 
g 5 


#1] 


IMc/sec. SQUARE 
WAVEFORM OF 
CURRENT 


TO DISPLAY 


BRIGHTNESS: 


-400¥ 


13 M.c/sec. AMPLIFIER RECTIFIER ADDING GATE 13 M.c/sec. TRANSMITTER 


=ve CHANNEL 
PULSE TO COINCIDENCE 


INITIAL PULSE 
TO ADDING GATE 


NUMBER_ OF 
CHANNELS 
SECECTOR 


§nt 5) 


LINEAR SWEEP CHANNEL FREQUENCY OSCILLATOR 


+ve CHANNEL 


PULSE TO 


DISPLAY 
13"¢ /sec. AMPLIFIER) 


Gve OUTPUT 


-200¥. 


SQUARE WAVEFORM 
OF CURRENT TO 
ADDING GATE 


IM.c/sec. OSCILLATOR -400v, 


806 On a High Pressure Pulse Height Analyser of Moderately High Speed 


Fi 


i 
tf oo Hey 


LINEAR Swe? 


DISPLAY~“Y” TIME BASE Xe Bae AMPLIFIER CATHODE RAY TUBE POWER SUPPLY 


[ 807 ] 


LXXXII. CORRESPONDENCE. 


On a Generalization of Wilson's Hypothesis. 


By GrorGe Lucnax, 
Defence Research Board Experimental Station, Ralston, Alberta*. 


[Received April 9, 1951.] 


Tw his paper “ A 4-dimensional Generalization of Wilson’s Hypothesis ”’, 
Dr. A. Papapetrou (1950) postulates that to every mass element at the 
point x' with 4-velocity uw‘ there corresponds another mass element called 
its “ gravitational centre” at X' with 4-velocity U‘. An observer A 
measures w' at a’ and, since, according to Papapetrou, he can find X‘ by 
solving the equations of the gravitational field, he can find U’. Now he 
is asked to form w'—U’. It is not clear how Papapetrou can possibly 
avoid violating the Special Theory of Relativity by such a procedure. 
At what times are u' and U‘ to be measured ? If they are to measured 
simultaneously by A, then a Lorentz transformation indicates that 
another observer A’ would measure wu” at x” and U" at X”. However, 
it is obvious that in general x’ 4X”; i.e. the measurements by the 
new observer are not made simultaneously for the two particles. Thus, 
the concept of simultaneously timed measurements of the two 4-vectors 
is not permissible if relativistic covariance is required. However, if these 
measurements are not to be made simultaneously, how are they to 
be made? How can Papapetrou avoid giving one particular inertial 
frame preference over the rest in specifying any possible scheme in 
place of simultaneity ? In essence, the postulate which Papapetrou 
introduces to displace the Wilson Hypothesis appears to be a retrograde 
step to the Newtonian idea of action at a distance. 
Papapetrou introduces a “ quantity ” w satisfying the equation 


ae 
[ ]w = oat ’ ° . . ° . . . ° (1) 
where 
ale), a at es + (2) 
and then writes . 
0 
pt ae = PH (pip eee ot, - (3) 


where p is the mass density, G is the gravitational constant, 7,, is the 
metric tensor for flat space, s; is the postulated current to replace the 
Wilson current and uw‘ and U‘ are defined as above. Since no boundary 
conditions are stipulated to restrict the generality of solutions of 


* Communicated by the Author, 


808 Correspondence 


equation (1), it is evident that even if p' be obtainable in principle, 
dw/dv' will not be uniquely determined for any prescribed physical 
conditions by solving equation (1). In Papapetrou’s first example of 
the use of his postulate, he concludes that w =0 from 


[lw =0, 5 6 ee 


and thus appears to demonstrate that equation (3) reduces to the Wilson 
expression. It is obvious, however, that equation (4) has an infinity of 
solutions any one of which might have been used. Papapetrou’s other 
examples all suffer from the same arbitrary use of particular solutions 
with no justification for his choice. 

Finally it is worth noting that, according to Papapetrou’s postulate, 
finite displacements of the ‘‘ gravitational centre’? may occur for 
infinitesimal displacements of the mass element and consequently, a 
discontinuity in the current density s; may take place while the gravita- 
tional field, presumably responsible for the current, is zero in value and 
changing in a continuous fashion. This does not seem physically plausible 
to the writer. Thus Papapetrou‘s generalization 

(a) violates the Special Theory of Relativity, 

(6) gives an infinity of solutions for any prescribed physical situation, 

and 
(c) in particular, does not reduce uniquely to Wilson’s Hypothesis. 


Any attempt to patch up the theory to eliminate (b) will probably require 
some specification of the physical significance of w but such an attempt 
seems pointless in view of (a). 


REFERENCE. 
PapaPETRou, A., 1950, Phil. Mag. [7] 42, 399. © 


[The Editors do not hold themselves responsible for the views 
_ expressed by their correspondents. ] 


